Build a daily Notion research paper library from arXiv

Every weekday at 8am, find the latest arXiv papers in your areas, skip ones you've already saved, and log fresh summaries to your Notion database.

Agentic Task
arXivNotionPersonal ProductivityOperationsResearch & MonitoringDaily Digests

Every weekday at 8am ET, build out my personal Notion research paper library from arXiv. This is an agent workflow on a cron trigger.

I will configure: a list of arXiv search topics or keywords, one or more arXiv subject categories (for example cs.AI and cs.LG), a short focus statement describing what I care about (for example "agent evaluation and benchmark design"), and the Notion database where my research library lives.

Step 1. Use arXiv's Search Papers operation to pull the latest preprints matching my configured topics, keywords, and categories. Sort by submitted date descending and pull a generous candidate window (for example the last 40 to 60 results across my queries) so there's enough to rank against.

Step 2. For each candidate paper, use Notion's Query a Data Source operation against my research database to check whether the arXiv ID is already logged. Filter on the arXiv ID property. Skip any paper that already exists, regardless of who created the row.

Step 3. For each genuinely new paper, read the abstract and produce a tight plain English summary in three to five sentences, the key contribution in one sentence, a relevance score from 1 to 10 against my focus statement, and 3 to 5 topic tags.

Step 4. Rank the new papers by relevance score and keep only the top 10 for this run, so the database stays curated rather than flooded.

Step 5. For each of the top 10, call Notion's Create a Page operation to add a row to my research database with these properties: Title, Authors, arXiv ID, Abstract URL, PDF URL, Published Date, Summary, Key Contribution, Relevance Score, and Tags.

Step 6. End the run by logging a one line activity note: how many candidates were considered, how many were already in Notion, and how many new pages were created.

Notes for the author. The Notion database uses the new data source semantics, so prefer Query a Data Source over the deprecated Query a Database. arXiv is a public no-auth API but is rate limited to roughly one request every 3 seconds, so space requests out. The Notion connection must be shared with the research database in Notion's UI before the workflow runs, otherwise every call returns empty.

Additional information

What does this prompt do?
  • Pulls the newest preprints in the topics and arXiv categories you care about, every weekday morning.
  • Checks your Notion research database first so you never see the same paper twice.
  • Writes a tight plain English summary, the key contribution, a relevance score, and 3 to 5 topic tags into each new row.
  • Caps each run at the top papers by relevance so your library stays curated, not flooded.
What do I need to use this?
  • A Notion workspace with a research database that has properties for title, authors, arXiv ID, abstract URL, PDF URL, published date, summary, key contribution, relevance score, and tags.
  • The database shared with your General Input Notion connection (Notion will not return data otherwise).
  • The topics, keywords, or arXiv subject categories you want to follow (for example cs.AI plus a short focus statement on agent evaluation).
How can I customize it?
  • Change the schedule, for example once a day, weekdays only, or twice a week.
  • Adjust the categories, keywords, or focus statement that drives what counts as relevant.
  • Raise or lower the daily paper cap to control how fast your library grows.

Frequently asked questions

Will it create duplicate rows if a paper shows up under two of my categories?
No. Before adding anything new, the workflow looks up each paper's arXiv ID in your Notion database and skips ones that are already logged.
What if I add a paper manually before the workflow runs?
It will be skipped on the next run because the dedup step matches on the arXiv ID property, regardless of who created the row.
Can I follow specific authors instead of just topics?
Yes. The search step accepts authors as part of your configured queries, so you can mix topic keywords, categories, and author names.
Does it download the PDFs?
No. It logs the abstract URL and PDF URL as links in the Notion row so you can open them on demand, but it does not store the file itself.
What happens on days when there are no new papers?
The workflow runs cleanly and logs a short note saying zero new papers were added. Nothing is written to Notion.

Stop hand-curating your reading list.

Connect Notion once and let Geni stock your research database every weekday morning.