File size: 1,534 Bytes
64e0bc7
0afbcef
 
64e0bc7
0afbcef
64e0bc7
0afbcef
64e0bc7
b0ecb2b
64e0bc7
b0ecb2b
 
 
 
 
 
 
 
 
 
 
 
 
c4c6dff
 
 
64e0bc7
 
0afbcef
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
---
title: Research Papers Federated Search
emoji: 📚
colorFrom: indigo
colorTo: blue
sdk: gradio
sdk_version: 5.50.0
app_file: app.py
pinned: true
license: mit
mcp_server: true
tags:
- research
- papers
- academic-search
- federated-search
- mcp
- mcp-server
- arxiv
- pubmed
- semantic-scholar
- openalex
- scholarly
datasets:
- barissozudogru/openalex-concepts
- barissozudogru/federated-search-traces
---

# Research Papers Federated Search

## What this does

Federated search across four academic sources in a single call:

- **arXiv** -- Computer science, physics, math, quantitative biology
- **PubMed** -- Biomedical literature via NCBI E-utilities
- **Semantic Scholar** -- Cross-disciplinary, includes citation counts
- **OpenAlex** -- ~250M works with canonical IDs and concept tags

Results are deduplicated by DOI when available, then by `(source, source_id)` so the same arXiv preprint surfaced through multiple sources collapses to one row.

## Cold start

This Space sleeps when idle. First query after a quiet period takes 30-60 seconds for the container to wake up. Subsequent queries are fast.

## The full server

This Space demonstrates the core federated search. The complete MCP server has citation graphs, BibTeX export, persistent SQLite cache, topic detection with embeddings, and trend analysis:

```
pip install git+https://github.com/barissozudogru/deep-research-digest.git
```

## Related

- Source code: <https://github.com/barissozudogru/deep-research-digest>
- Maintainer: <https://bsozudogru.com>