Join us in transforming healthcare in Africa through Artificial Intelligence!
Watch the tutorials below on how to use ChatGPT for Data Analysis and use our Cheat Sheets for guidance on prompting.
Contact us for opportunities to partner in using AI tools to improve the accuracy and efficiency of data analysis:
Unlocking the Power of Data to Improve Health
Healthcare systems in low- and middle-income countries (LMICs) struggle to make sense of vast amounts of data due to scarcity of analytical skills. By using new Artificial Intelligence (AI) technology like ChatGPT, we aim to bridge this data interpretation gap, thereby supporting decision-making and strengthening health care systems.
We used ChatGPT-4 to analyse and interpret routine health programme data, comparing AI-generated outputs with traditional analysis and statistical software. Working closely with the South African Department of Health, we developed an AI-supported data-analysis ‘cheat sheet’ (available to download below) to scale-up this skill.
Embracing ChatGPT as a tool to assist with complex data analysis is a big step forward in improving how data is understood and utilised, especially in LMICs. The AI-toolkit can be adapted and used for any data interpretation in any sector. While individuals using the toolkit must have an in-depth understanding of the programme, they do not need to be specialists in data analytics.
ChatGPT Data Analysis Toolkit
Tutorial Videos on how to setup and use ChatGPT for Data Analysis
Tutorial 1: Setup
Background and Basic Setup of ChatGPT
Tutorial 2: Upload
Preparing and Uploading Data to ChatGPT
Tutorial 3: Cleaning
Cleaning and Downloading Data with ChatGPT
Tutorial 4: Basic Analysis
Basic Data Analysis and Visualization with ChatGPT
Tutorial 5: Insights
Data Insights with ChatGPT
Tutorial 6: Performance Insights
Programme Performance Insights from ChatGPT
Tutorial 7: Strategic Insighs
Strategic Insights from ChatGPT
Tutorial 8: Ethics and AI
Ethics and Limitations of AI
SOP and Prompt Sheets for Data Analysis with ChatGPT
Use these guides to set up and use ChatGPT-4 for data analysis purposes:
Hardware and software requirements:
- A functional computer, either laptop or desktop.
- A reliable and uninterrupted internet connection
- A web browser compatible with the ChatGPT platform.
Subscription to ChatGPT-4:
- Using a credit card, secure a monthly subscription to the latest version – currently ChatGPT-4 for a monthly fee of 20USD.
- To subscribe, visit https://chat.openai.com
Small data sets or simple tables:
- Directly drag and drop the dataset into ChatGPT chat window.
Large data sets:
- For large data sets or multiple files, use the Attachment icon situated within the chat interface.
- The system allows for the simultaneous upload of multiple datasets or multiple workbooks within a single dataset.
Supported file formats:
- ChatGPT-4 supports a range of data types, including but not limited to CSV, Excel, PDF and dta.
Using the prompt sheets:
Refer to the provided prompt sheets in the toolkit for specific data analysis instructions. This sheet is curated to offer guidance tailored to ChatGPT-4’s capabilities.
Data description and cleaning:
Utilize the prompts specified for data description, sorting and grouping.
Data analysis:
The prompt sheet offers tailored prompts for both patient-level and cumulative dataset analysis.
Merging datasets:
Refer to the prompt sheet for instructions on merging various data sets.
Data visualization:
Utilize the visualization prompts from the prompt sheet to generate insightful charts and graphs.
Generating reports and insights:
The prompt sheet provides prompts for generating analysis insights, comprehensive reports, summaries, and presentation materials.
NOTES:
-
Ensure that sensitive data is anonymized or encrypted before uploading for analysis. ChatGPT-4 respects user privacy, but it’s always a good practice to maintain data confidentiality.
-
Regularly check the OpenAI platform for updates or changes in features and functionalities.
- Calculating Proportions
- Sorting Data
- Presenting Data
The prompts and examples below are based on a DHIS2 Data Export for the TB control programme.
Be sure to use the indicators and descriptions as they appear in your data set.
- Data Description
- Trends
- Data Interpretation
- Programme Performance
- Insights into Predictive Analytics
The prompts and examples below are based on a DHIS Data Export for DS-TB.
Be sure to use the indicators and descriptions as they appear in your data set.
The prompts and examples below are based on a DHIS Data Export for DS-TB.
Be sure to use the indicators and descriptions as they appear in your data set.
- Calculating Proportions
- Sorting Data
- Presenting Data
The prompts and examples below are based on an EDR Web Data Export.
Be sure to use the indicators and descriptions as they appear in your data set.
PLEASE NOTE: When using Patient level data, it is critical that Patient Identifying Information (e.g. Names) are removed and the data is anonymized.
- Data Description
- Data Trends
- Data Interpretation
- Programme Performance
- Insights into Predictive Analytics
- Advanced Data Analysis
- Key findings
The prompts and examples below are based on an EDR Web Data Export.
Be sure to use the indicators and descriptions as they appear in your data set.
PLEASE NOTE: When using Patient level data, it is critical that Patient Identifying Information (e.g. Names) are removed and the data is anonymized.
Be sure to use the indicators and descriptions as they appear in your data set.
PLEASE NOTE: When using Patient level data, it is critical that Patient Identifying Information (e.g. Names) are removed and the data is anonymized.