Learn how to build an AI agent

Subscribe for more insights

Thank you for your interest in the syllabus.

We'll be in touch soon with more information!

In the meantime, if you have any questions, don’t hesitate to reach out to our team.

Download Email Us

Oops! Something went wrong while submitting the form.

Insights
>
Technical Tutorials
>
How to build an AI agent?

Learn how to build a real AI agent, not just a chatbot, that can think, act, and adapt across multiple environments. This guide by Will Sentance, Chief AI Officer of Codesmith, breaks down the four-stage agent loop (Input → Reasoning → Action → Output) and shares best practices for logging, error handling, and building resilient multi-runtime systems.

Step-by-step walkthrough using Python, OpenAI, Automator, and Google Apps Script.
Real example: turning screenshots into automated Google Calendar events.
Ideal for developers looking to master AI orchestration, automation, and hands-on workflow design.

When people think about AI, they often imagine a single model call: you send a prompt to an LLM, and you get a response back. That’s powerful, but it’s not an agent.

An AI agent is more than an API call. It's a system that can reason, act, and adapt across multiple runtimes. It’s about chaining logic, managing actions, handling errors, and building workflows that can actually execute in the real world.

In this article, I’ll break down how to build an AI agent from the ground up, based on the principles I use in practice.

The core flow of an agent

Every agent follows the same four-stage loop:

Input - the agent receives something: a user query, a goal, or an external event trigger.
Reasoning - the agent decides what to do. This often involves breaking down a task into smaller steps.
Action - the agent executes those steps. This could mean calling APIs, searching files, or running scripts.
Output - the agent provides an answer, triggers automation, or sends data elsewhere.

This flow repeats. An agent may reason, act, and observe several times before it decides it has reached a final answer.

Why is logging crucial?

When you write plain JavaScript or Python, debugging is simple: console.log or print tells you what’s going on. With an agent, that doesn’t work.

Here’s why: agents often run across five or six different runtimes. Some parts execute locally, others in the cloud, others inside a third-party service. Without proper logging, you have no visibility into what’s happening.

Best practices for logging in agents:

Centralize your logs instead of scattering them across services.
Always tag logs with the runtime or environment where they were generated.
Split info logs (reasoning, step flow) from error logs (failures).
Keep logs human-readable; you’ll need them when debugging multi-step failures.

Handling errors across environments

Agents will fail. APIs timeout, models hallucinate, and services crash. The goal is not to prevent failure, but to design for resilience.

Error handling strategies for agents:

Retries with backoff - If an API call fails, retry it with exponential delays.
Fallbacks - Have a secondary plan. If one tool fails, try another.
Source awareness - Distinguish between errors caused by your own logic, the model, or an external service.

When an agent fails, it should degrade gracefully instead of collapsing.

Choosing a language

I’m often asked: should you build agents in Python or JavaScript?

Python is the industry standard for AI/ML systems.
- Rich ecosystem of AI libraries (PyTorch, TensorFlow, scikit-learn).
- Strong tooling for orchestration frameworks like LangChain and LangGraph.
- Easier integrations with model providers and data pipelines.
JavaScript shines for agents that are web-first.
- If your agent lives in a browser or interacts heavily with frontend logic, JS can be simpler.
- Good choice for UI-driven experiences.

In most cases, Python is the right default, but don’t ignore JS if your agent belongs in the browser.

‍

Building blocks of a hands-on agent

To make this practical, here are the components you need to wire up:

1. Reasoning Engine

The LLM is the reasoning layer. It decides what action to take next. Think of it as the brain, not the whole body.

2. Tooling Layer

Agents need “hands” to interact with the world. Tools can be:

File readers/writers
Web search APIs
Database connectors
Automation hooks (calendar, Slack, email)

3. Observation Feedback

After taking an action, the agent must “see” what happened. For example:

Action: web_search("vector databases")
Observation: returns 5 results
The reasoning loop continues from this observation.

4. Logging and Error Handling

Integrate logging at every step and wrap tool calls with try/except or equivalent. Every action should either return a result or a clear error message.

5. Output

Finally, the agent delivers a result, an answer, a file, or an automated step.

Frameworks and Tools

You can build all of this from scratch, but frameworks speed things up:

LangChain - chaining reasoning steps, managing memory, and integrating tools.
LangGraph - a graph-based approach to agent orchestration and visualization.
Ollama - run local models for privacy and offline use. Great when you don’t want data leaving your machine.

These tools don’t replace the fundamentals, but they make complex agents easier to manage.

Scaling Beyond Basics

Once you’ve built a working prototype, you’ll want to extend it. Here are some directions:

Automation - Let your agent trigger workflows: send alerts, shift event times, or manage files.
Private deployments - Use local models like Ollama for sensitive data.
Observability - Add structured logging and monitoring dashboards for multi-agent systems.
Memory - Store past conversations or actions so the agent can improve over time.

‍

‍

How to build an AI agent walkthrough

‍

This walkthrough is based on Codesmith’s recent workshop, “Build Your First AI Agent.” It’s not meant to replace the workshop, but instead to give you a clear, step-by-step look at the overall workflow and logic that Will Sentence follows. If you haven’t already, we highly encourage you to check out the full session on YouTube and code along with Will! After all, the best way to learn this stuff is by going hands on!

In this walkthrough, we’re going to trace the full journey of how an AI agent can take a screenshot, extract all of the meaningful information from it using OpenAI’s API, and then automatically create a Google Calendar event.

‍

By the end, you should have a high-level understanding of how each of the different components (Automator, Python, OpenAI, and Google Apps Script) all fit together to form an end-to-end AI-powered automation!

‍

Setting up Automator: First, we’ll start by creating an Automator workflow on your computer that runs a Python script whenever a new screenshot is taken and added to a folder.

‍

‍

Building the Screenshot Agent: Next, in VS Code, we’ll write a Python script (screenshot_agent.py) that logs each new screenshot and then triggers the next step in the workflow (line 16).

‍

‍

After we take a test-screenshot, we can see our message logged in screenshot_agent.log! This log file will get much more detailed as we add more steps to our workflow and log as we go but, for now, we just see a log of our first screenshot.

‍

Describing the Image with OpenAI: Of course, our app needs to do much more than just log a screenshot! We’ll need to send the screenshot to the OpenAI API. Note this image only shows part of the script; the full file also includes the API request logic, error handling, and response parsing that turn the model’s output into usable event data. Also Note: You should ALWAYS save your API keys in a save environment like a .env file but, for clarity’s sake, we hard-coded it into this file.

‍

‍

Below is our extract_prompt. We’ll pass this prompt to the OpenAI API. It tells OpenAI exactly what to do by defining the task, output format, and rules it has to follow. This will guide the model to extract only the key details from the screenshot, make reasonable assumptions when information is missing (like an event’s end time), and return everything in a clean, structured JSON format.

This structured output is what we will pass along to our Google Script for creating a calendar event!

‍

‍

Remember, logging is important! This is a simple helper function. This does the same thing as the helper function we saw in screenshot_agent.py - it appends a message to our logs (screenshot_agent.log) that includes the current time and a message. This function will be invoked throughout our extract_event.py to log the process of communicating with the OpenAI API.

‍

‍

Creating the Calendar Event: We’ll pass our structured JSON data to a Google Apps Script endpoint that uses the Google Calendar API to automatically create an event.

Note: Just as our OpenAI API key should be stored in a secure environment variable, our WEB_APP_URL (which connects the app to Google Apps Script) should also be kept safe instead of hard-coded in the script.

‍

The final link in our Agent’s chain is the Google App’s Script. This will do two main things:

‍

Confirm that our web app is live and ready to receive data (the doGet() function)
Either add the event to our Google calendar and return a success response of {ok: true, id: “the events id”} OR, if the event cannot be added to the calendar, it will send a failure response of { ok: false, error: "error message" }.

‍

Testing the Full Workflow: Finally, we’ll see all the pieces come together! We will take a screenshot of an event on codesmith.io and see a brand-new calendar event appear in Google Calendar which will all get logged step-by-step in screenshot_agent.log.

‍

And now the moment of truth, the event shows up in Google Calendar!

‍

‍

And, below we see each step in our workflow reflected in our screenshot_agent.log:

‍

‍

By now, you should have a high-level understanding of how we leveraged our Automator, the OpenAI API, Google Apps Script, and a logging system to create a working AI Agent! To get a more in-depth understanding of this process, be sure to check out my workshop, “Build Your First AI Agent” .

To follow along using the source code, check the Github repository.

‍

An AI agent is not a toy. It’s a distributed system that requires architecture, resilience, and care. Start small: wire up the four-step loop (input → reasoning → action → output). Then add logging, error handling, and a couple of tools. From there, you can expand into frameworks, private models, and advanced automation.

‍

The key is to remember: the LLM is the brain, not the whole system. The agent comes alive when you connect reasoning, tools, and feedback into a loop that can actually get things done. That’s how you build an AI agent. Not just a chatbot, but a system that thinks, acts, and adapts.

‍

Check our free events to join more of our practical workshops.

‍

Find out how we cover AI/ML in our updated curriculum

Get your Syllabus

Special blog guest offer!

Explore CS Prep further in our beginner-friendly program.

Get 50% Off CS Prep

Learning code on your own?

Get more free resources and access to coding events every 2 weeks.

Thank you for your interest in the syllabus.

We'll be in touch soon with more information!

In the meantime, if you have any questions, don’t hesitate to reach out to our team.

Download Email Us

Oops! Something went wrong while submitting the form.

Want to learn more about advancing your career in tech?

Connect with one of our graduates/recruiters.

Schedule a Call

Our graduates/recruiters work at:

ABOUT THE AUTHOR

Will Sentance is the co-founder of Codesmith, where he’s driven the mission to equip diverse learners with the skills and mental models to thrive in software and AI.

Will Sentance

Co-founder

How to build an AI agent?

Table of contents

The core flow of an agent

Why is logging crucial?

Best practices for logging in agents:

Handling errors across environments

Choosing a language

Building blocks of a hands-on agent

1. Reasoning Engine

2. Tooling Layer

3. Observation Feedback

4. Logging and Error Handling

5. Output

Frameworks and Tools

Scaling Beyond Basics

How to build an AI agent walkthrough

Thank you for your interest in the syllabus.

ABOUT THE AUTHOR

Related Articles

Introduction to Recursion in JavaScript

What Is Coding? A Plain-English Guide With Real Examples

JavaScript From Zero: Step by Step Guideline

Start your journey to a coding career.

Thank you for your interest in the syllabus.