Airtable + GPT: Prototyping a Light-weight RAG System with No-Code Instruments

Picture by Editor | ChatGPT

# Introduction

Prepared for a sensible walkthrough with little to no code concerned, relying on the method you select? This tutorial reveals easy methods to tie collectively two formidable instruments — OpenAI‘s GPT fashions and the Airtable cloud-based database — to prototype a easy, toy-sized retrieval-augmented era (RAG) system. The system accepts question-based prompts and makes use of textual content information saved in Airtable because the data base to supply grounded solutions. In the event you’re unfamiliar with RAG programs, or desire a refresher, don’t miss this article sequence on understanding RAG.

# The Components

To observe this tutorial your self, you will want:

An Airtable account with a base created in your workspace.
An OpenAI API key (ideally a paid plan for flexibility in mannequin alternative).
A Pipedream account — an orchestration and automation app that permits experimentation underneath a free tier (with limits on day by day runs).

# The Retrieval-Augmented Era Recipe

The method to construct our RAG system isn’t purely linear, and a few steps will be taken in several methods. Relying in your stage of programming data, chances are you’ll go for a code-free or practically code-free method, or create the workflow programmatically.

In essence, we’ll create an orchestration workflow consisting of three components, utilizing Pipedream:

Set off: just like an internet service request, this ingredient initiates an motion move that passes by way of the following components within the workflow. As soon as deployed, that is the place you specify the request, i.e., the consumer immediate for our prototype RAG system.
Airtable block: establishes a connection to our Airtable base and particular desk to make use of its information because the RAG system’s data base. We’ll add some textual content information to it shortly inside Airtable.
OpenAI block: connects to OpenAI’s GPT-based language fashions utilizing an API key and passes the consumer immediate alongside the context (retrieved Airtable information) to the mannequin to acquire a response.

However first, we have to create a brand new desk in our Airtable base containing textual content information. For this instance, I created an empty desk with three fields (ID: one-line textual content, Supply: one-line textual content, Content material: lengthy textual content), after which imported information from this publicly obtainable small dataset containing textual content with primary data about Asian nations. Use the CSV and hyperlink choices to import the information into the desk. Extra details about creating tables and importing information is in this text.

After getting your data base prepared in Airtable, let’s swap to Pipedream for the remainder of the method! Register if you happen to haven’t carried out so but, and log in. There are a number of methods to provoke a workflow. One among them is by describing what you wish to create to Pipedream’s AI agent, which is intuitive however could lead to errors needing fixes, as we’ll illustrate shortly. Pipedream has two parallel UIs: one for manually constructing and enhancing workflows, and one for conversations with its AI agent, known as String.

Pipedream’s AI conversational interface (String)

In the event you go for letting String construct an approximate model of your required workflow, describe clearly what to construct, with a immediate which will look one thing like:

I would prefer to orchestrate a easy toy RAG utility that makes use of an Airtable desk containing textual content, and OpenAI’s GPT fashions (I’ve a paid API key).

Pipedream’s language mannequin could return a solution so that you can approve (at your individual danger), like this:

I will create a easy RAG (Retrieval-Augmented Era) agent that connects your Airtable desk with OpenAI’s GPT fashions. Here is what I will construct:

Arrange a set off to simply accept queries (through webhook or handbook execution)
Search and retrieve related textual content out of your Airtable desk based mostly on the question
Use the retrieved textual content as context for OpenAI GPT to generate an knowledgeable response
Return the AI-generated reply that is grounded in your Airtable information
This gives you a primary RAG system the place you possibly can ask questions and get solutions based mostly on the content material saved in your Airtable, enhanced by GPT’s reasoning capabilities.

For the sake of experimentation and “seeing what occurs,” approve the instructed plan of action to acquire a three-block workflow just like the one within the picture proven above.

Since this isn’t the core Pipedream editor UI, swap to it by clicking “Open Pipedream” within the higher proper nook. A brand new tab will open with the precise Pipedream workflow editor.

For the set off block, a URL is routinely generated with a syntax just like this one I acquired for mine: https://eoupscprutt37xx.m.pipedream.internet. Click on it and, within the settings pane that opens on the right-hand facet, guarantee the primary couple of choices are set to “Full HTTP request” and “Return a static response.”

For the second block (Airtable motion) there could also be a bit work to do. First, hook up with your Airtable base. In the event you’re working in the identical browser, this is perhaps easy: register to Airtable from the pop-up window that seems after clicking “Join new account,” then comply with the on-screen steps to specify the bottom and desk to entry:

Pipedream workflow editor: connecting to Airtable

Right here comes the tough half (and a purpose I deliberately left an imperfect immediate earlier when asking the AI agent to construct the skeleton workflow): there are a number of varieties of Airtable actions to select from, and the precise one we’d like for a RAG-style retrieval mechanism is “Checklist data.” Likelihood is, this isn’t the motion you see within the second block of your workflow. If that’s the case, take away it, add a brand new block within the center, choose “Airtable,” and select “Checklist data.” Then reconnect to your desk and take a look at the connection to make sure it really works.

That is what a efficiently examined connection appears like:

Pipedream workflow editor: testing connection to Airtable

Final, arrange and configure OpenAI entry to GPT. Preserve your API key useful. In case your third block’s secondary label isn’t “Generate RAG response,” take away the block and exchange it with a brand new OpenAI block with this subtype.

Begin by establishing an OpenAI connection utilizing your API key:

Establishing OpenAI connection

The consumer query area ought to be set as {{ steps.set off.occasion.physique.take a look at }}, and the data base data (your textual content “paperwork” for RAG from Airtable) have to be set as {{ steps.list_records.$return_value }}.

You possibly can hold the remaining as default and take a look at, however chances are you’ll encounter parsing errors frequent to those sorts of workflows, prompting you to leap again to String for assist and computerized fixes utilizing the AI agent. Alternatively, you possibly can instantly copy and paste the next into the OpenAI element’s code area on the backside for a sturdy answer:

import openai from "@pipedream/openai"

export default defineComponent({
  title: "Generate RAG Response",
  description: "Generate a response utilizing OpenAI based mostly on consumer query and Airtable data base content material",
  sort: "motion",
  props: {
    openai,
    mannequin: {
      propDefinition: [
        openai,
        "chatCompletionModelId",
      ],
    },
    query: {
      sort: "string",
      label: "Person Query",
      description: "The query from the webhook set off",
      default: "{{ steps.set off.occasion.physique.take a look at }}",
    },
    knowledgeBaseRecords: {
      sort: "any",
      label: "Information Base Information",
      description: "The Airtable data containing the data base content material",
      default: "{{ steps.list_records.$return_value }}",
    },
  },
  async run({ $ }) {
    // Extract consumer query
    const userQuestion = this.query;
    
    if (!userQuestion) {
      throw new Error("No query offered from the set off");
    }

    // Course of Airtable data to extract content material
    const data = this.knowledgeBaseRecords;
    let knowledgeBaseContent = "";
    
    if (data && Array.isArray(data)) {
      knowledgeBaseContent = data
        .map(file => {
          // Extract content material from fields.Content material
          const content material = file.fields?.Content material;
          return content material ? content material.trim() : "";
        })
        .filter(content material => content material.size > 0) // Take away empty content material
        .be part of("nn---nn"); // Separate totally different data base entries
    }

    if (!knowledgeBaseContent) {
      throw new Error("No content material present in data base data");
    }

    // Create system immediate with data base context
    const systemPrompt = `You're a useful assistant that solutions questions based mostly on the offered data base. Use solely the knowledge from the data base beneath to reply questions. If the knowledge just isn't obtainable within the data base, please say so.

Information Base:
${knowledgeBaseContent}

Directions:
- Reply based mostly solely on the offered data base content material
- Be correct and concise
- If the reply just isn't within the data base, clearly state that the knowledge just isn't obtainable
- Cite related components of the data base when attainable`;

    // Put together messages for OpenAI
    const messages = [
      {
        role: "system",
        content: systemPrompt,
      },
      {
        role: "user",
        content: userQuestion,
      },
    ];

    // Name OpenAI chat completion
    const response = await this.openai.createChatCompletion({
      $,
      information: {
        mannequin: this.mannequin,
        messages: messages,
        temperature: 0.7,
        max_tokens: 1000,
      },
    });

    const generatedResponse = response.generated_message?.content material;

    if (!generatedResponse) {
      throw new Error("Didn't generate response from OpenAI");
    }

    // Export abstract for consumer suggestions
    $.export("$abstract", `Generated RAG response for query: "${userQuestion.substring(0, 50)}${userQuestion.size > 50 ? '...' : ''}"`);

    // Return the generated response
    return {
      query: userQuestion,
      response: generatedResponse,
      model_used: this.mannequin,
      knowledge_base_entries: data ? data.size : 0,
      full_openai_response: response,
    };
  },
})

If no errors or warnings seem, you ought to be prepared to check and deploy. Deploy first, after which take a look at by passing a consumer question like this within the newly opened deployment tab:

Testing deployed workflow with a prompt asking what is the capital of Japan

Testing deployed workflow with a immediate asking what’s the capital of Japan

If the request is dealt with and all the pieces runs appropriately, scroll right down to see the response returned by the GPT mannequin accessed within the final stage of the workflow:

GPT mannequin response

Nicely carried out! This response is grounded within the data base we in-built Airtable, so we now have a easy prototype RAG system that mixes Airtable and GPT fashions through Pipedream.

# Wrapping Up

This text confirmed easy methods to construct, with little or no coding, an orchestration workflow to prototype a RAG system that makes use of Airtable textual content databases because the data base for retrieval and OpenAI’s GPT fashions for response era. Pipedream permits defining orchestration workflows programmatically, manually, or aided by its conversational AI agent. Via the writer’s experiences, we succinctly showcased the professionals and cons of every method.

Iván Palomares Carrascosa is a frontrunner, author, speaker, and adviser in AI, machine studying, deep studying & LLMs. He trains and guides others in harnessing AI in the true world.

Main Menu

What's Hot

Customers, Progress, and International Traits

CISA Points Alert on Wing FTP Server Vulnerability Utilized in Assaults

Nvidia's DGX Station is a desktop supercomputer that runs trillion-parameter AI fashions with out the cloud

Airtable + GPT: Prototyping a Light-weight RAG System with No-Code Instruments

High 7 Free Machine Studying Programs with Certificates

AWS and NVIDIA deepen strategic collaboration to speed up AI from pilot to manufacturing

5 Vital Shifts D&A Leaders Should Make to Drive Analytics and AI Success

Evaluating the Finest AI Video Mills for Social Media

Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

Midjourney V7: Quicker, smarter, extra reasonable

Meta resumes AI coaching utilizing EU person knowledge

Customers, Progress, and International Traits

CISA Points Alert on Wing FTP Server Vulnerability Utilized in Assaults

Nvidia's DGX Station is a desktop supercomputer that runs trillion-parameter AI fashions with out the cloud

New AI Management Guidelines with Emily Discipline, CPO of LPL Monetary

Main Menu

Subscribe to Updates

What's Hot

Airtable + GPT: Prototyping a Light-weight RAG System with No-Code Instruments

# Introduction

# The Components

# The Retrieval-Augmented Era Recipe

# Wrapping Up

Related Posts