Automated Academic Paper Metadata & Variable Extraction with Gemini to Google Sheets
Go to WorkflowDescription
๐In the social and behavioral sciences (e.g., psychology, sociology, economics, management), researchers and students often need to normalize academic paper metadata and extract variables before any literature review or meta-analysis.
๐งฉThis workflow automates the busywork. Using an LLM, it processes CSV/XLSX/XLS files (exported from WoS, Scopus, EndNote, Zotero, or your own spreadsheets) into normalized metadata and extracted variables, and writes a neat table to Google Sheets.
๐ Example Google Sheet:ย click me
๐ฅ Who is this for?
๐ Undergraduate and graduate students or researchers in soft-science fields (psychology, sociology, economics, business)
โฑ๏ธ People who donโt have time to read full papers and need quick overviews
๐ Anyone who wants to automate academic paper metadata normalization and variable extraction to speed up a literature review
โ๏ธ How it works
๐ค Upload an academic paper file (CSV/XLSX/XLS) in chat.
๐ The workflow creates a Google Sheets spreadsheet with two tabs: Checkpoint and FinalResult.
๐ A structured-output LLM normalizes core metadata (title, abstract, authors, publication date, source) from the uploaded file and writes it to Checkpoint; ๐ง a Gmail notification is sent when finished.
๐งช A second structured-output LLM uses the metadata above to extract variables (Independent Variable, Dependent Variable) and writes them to FinalResult; ๐ง youโll get a second Gmail notification when done.
๐ ๏ธ How to set up
๐ Credentials
Google Sheets OAuth2** (read/write)
Gmail OAuth2** (send notifications)
Google Gemini (or any LLM you prefer)**
๐ Quick start
Connect Google Sheets, Gmail, and Gemini (or your LLM) credentials.
Open File Upload Trigger โ upload your CSV/XLSX/XLS file and type a name in chat (used as the Google Sheets spreadsheet title).
Watch your inbox for status emails and open the Google Sheets spreadsheet to review Checkpoint and FinalResult.
๐ Customization
๐๏ธ Journal lists: Edit the Journal Rank Classifier code node to add/remove titles. The default list is for business/management journalsโswap it for a list from your own field.
๐ Notifications: Replace Gmail with Slack, Teams, or any channel you prefer.
๐ง LLM outputs: Need different metadata or extracted data? Edit the LLMโs system prompt and Structured Output Parser.
๐ Note
๐ Make sure your file includes abstracts. If the academic paper data you upload doesnโt contain an abstract, the extracted results will be far less useful.
๐งฉ CSV yields no items? Encoding mismatches can break the workflow. If this happens, convert the CSV to .xls or .xlsx and try again.
๐ฉ Help
Contact: [email protected]