Build an AI-Powered Meeting Summarizer with Twilio Voice, SendGrid, OpenAI, and Python
Time to read:
Build an AI-Powered Meeting Summarizer with Python, Twilio Voice, and OpenAI
Introduction
Have you ever ended a conference call and immediately forgotten key points that were discussed? Or wished you had an easy way to share meeting notes with team members who couldn't attend? In this tutorial, you'll learn how to build an automated AI-Powered meeting summarizer using Twilio Programmable Voice, Twilio SendGrid, OpenAI, and Python. The system records the conference call conversations, transcribes them using OpenAI's Whisper, creates summaries with GPT-4, and automatically emails results to participants through SendGrid.
At the end of this tutorial you'll have:
- A Flask server that handles conference calls and recording webhooks using Twilio Voice APIs.
- An automated system that processes conference recordings through OpenAI's Whisper for transcription.
- A summarization pipeline using GPT-4 to generate meeting summary notes.
- An email delivery implementation using Twilio SendGrid to send summaries to participants.
Prerequisites
To follow along with this tutorial, you should have:
- Python 3.9+. (Version 3.9.13 or newer recommended). You can download it here.
- A Twilio account. If you are new to Twilio, click here to create a free account.
- A Twilio number with Voice capabilities. You can set up or configure a trial Twilio phone number to receive and respond to voice calls here.
- A Twilio SendGrid account. Sign up for a free account to send emails.
- Two verified phone numbers (Caller ID) for testing conference calls. See how to add phone numbers to verified caller ID on your Twilio account here.
- An OpenAI account and API key. Sign up for one here.
- (Optional) A Render account to deploy your server for testing. Create a Render account here. Alternatively, another tunneling solution like ngrok that can expose your local server to the internet can be used.
Make sure you have all the above ready before proceeding. All required package dependencies will be covered in the project setup section.
Setup the Voice Meeting Summarizer Python Project
Step 1: Initialize the Project
First, create a new directory for your project and set up a Python virtual environment. This will keep your project dependencies isolated from other Python projects on your system.
Create a new project directory and initialize it in your command line by running this block of code:
Once your virtual environment has been activated, create the basic project structure:
These commands will create four empty files: a .env file for environment variables, requirements.txt for package dependencies, app.py for our main application code and gitignore for excluding files that should not be pushed to the repository.
Step 2: Install Dependencies
To install the dependencies needed for the project, open the requirements.txt file and add the necessary packages:
Here’s what each package does:
- flaskis our web framework for handling HTTP requests and webhooks.
- python-dotenvhelps manage environment variables for our API keys.
- twilioprovides the SDK for working with Twilio Voice and conferences.
- openaiallows us to use Whisper for transcription and GPT-4 for summaries.
- sendgridhandles email delivery through Twilio SendGrid.
- requestsmanages HTTP requests for downloading recordings.
- httpxis a used for making HTTP requests with async support.
Install these dependencies with pip using the below command:
This command reads the requirements.txt file and installs all listed packages into your virtual environment.
Finally, set up your environment variables in the . env file. These will store sensitive API keys and configurations:
Step 3: Write the Server Code
After setting up project files and installing our dependencies, you'll create the server code step by step, starting with the basic setup and gradually adding more functionality as you progress.
Step 3.1: Setting up Basic Server
First, create a simple Flask server to ensure everything is working correctly. Inside app.py add these initial imports and basic route:
Save the file and run the below command in your terminal:
Open your browser or use `curl` to test the endpoint:
You should see the message "Voice Meeting Summarizer is running!"
Step 3.2: Configure Dependencies and Initialize Clients
Next, you'll set up our main dependencies and initialize the clients needed for the voice meeting summarizer. You'll configure OpenAI for transcription and summary generation, Twilio for voice handling, and SendGrid for email delivery. Here's the complete code for the app.py:
app.py
Step 3.3: Setting Up Conference Routes
Now, you'll add two routes to handle conference functionality: one for joining conferences and another for monitoring conference status. Add this code to app.py, after your @app.route('/') block, but before the line if __name__ == '__main__'::
Step 3.4: Handling Recording Processing and Email Delivery
This step sets up the recording callback route that processes the conference recording, transcribes it, generates transcript & summary and delivers the formatted summary to participants via email. This is added under the code you just added in app.py.
With this implementation, when two people call your Twilio number, the conference and recording starts, once the conference call finishes, the system:
- Downloads the conference recording
- Processes conference recordings using OpenAI's Whisper and GPT-4
- Creates a styled email template
- Sends summary and transcript to configured recipients
Testing the Application
To test your Voice Meeting Summarizer, you'll need to deploy it and configure your Twilio webhook. This guide covers deployment using the Render cloud platform. For more information regarding render you can check out this guide. Alternatively, if you prefer local testing, you can use ngrok by following this guide.
- Go to Render website and sign in to your account
- Click "New +" button in the top right and select "Web Services"
- Under the Source Code section you will have the option to connect your git provider to deploy from your existing repositories or enter a public git repository's URL. Once you’ve made your selection, click connect. Choose "Build and deploy from a Git repository"
- Connect your GitHub account and select your voice-meeting-summarizerrepository
- Enter a unique name for your web service, such as voice-meeting-summarizer. This will inform the URL that Render will create for you.
- For the Start Command, you'll see gunicorn app:appas the default. Change this topython app.py. You're usingpython app.pyinstead of gunicorn because your Flask app is configured to run directly with the built-in development server, which is simpler for this tutorial and includes the port configuration you set up in your code.
- Click "Create Web Service".
- After your web service is created you will see the publicly accessible URL below at the top of the page. Copy the URL and navigate to Manage > Environment on the left navigation pane. Here you can assign the BASE_URL environment variable to that value and the other variables that were defined earlier, for example, OPENAI_API_KEY.
- Click "Save Changes" - this will trigger a redeploy.
The below image shows what your Render dashboard should look like after you have successfully deployed your application and the server has started running. You should see "Running python app.py" in the logs. If you're using ngrok for local development instead, you can run python app.py locally to verify your server is running.
 
			
		 
			
		Open your Twilio console, go to the Develop > Phone Numbers > Manage > Active Numbers > Configure. Set the A Call Comes In webhook URL  to your  Render URL or ngrok URL, then append /join-conference to the URL and click on Save. In my own case, I will set the URL to https://voice-meeting-summarizer.onrender.com/join-conference
 
			
		 
			
		- Start a test conference:
- Call your Twilio number from one phone.
- Join from another phone (you'll hear a beep when participants join). You should start to see the logs in the server when the conference starts.
 
			
		 
			
		- Have a short conversation - Once the conversation starts, you should see logs starting to appear in the Render console or in your local server logs.
 
			
		 
			
		- End the call - You should see the summary,transcriptand the logs thatEmail sent successfullyto 2 recipients, since I set myRECIPIENT_EMAILin my.envto be to 2 emails.
 
			
		 
			
		- Within few minutes, you should receive an email containing:
- A summary of your conversation during the conference call.
- The complete transcript.
- Recording ID for reference.
 
			
		 
			
		Conclusion
Congratulations! You've built an AI-powered meeting summarizer that combines Twilio's voice capabilities with OpenAI's advanced language models. Your application can now:
- Host conference calls with multiple participants.
- Record conversations automatically.
- Transcribe meetings using Whisper.
- Generate concise summaries using GPT-4.
- Deliver formatted summaries and transcript through email.
If you want to view the full source code for this app, check it out on GitHub.
Happy Coding!
Taofiq is a full-stack software engineer specializing in AI-powered applications and automation tools. He focuses on creating practical solutions that help teams work more efficiently. Connect with him on Github.
Related Posts
Related Resources
Twilio Docs
From APIs to SDKs to sample apps
API reference documentation, SDKs, helper libraries, quickstarts, and tutorials for your language and platform.
Resource Center
The latest ebooks, industry reports, and webinars
Learn from customer engagement experts to improve your own communication.
Ahoy
Twilio's developer community hub
Best practices, code samples, and inspiration to build communications and digital engagement experiences.
 
     
    