Spaces:
Running
Running
Upload 2 files
Browse files- README.md +501 -117
- index.html +831 -389
README.md
CHANGED
|
@@ -1,179 +1,563 @@
|
|
| 1 |
---
|
| 2 |
-
title:
|
| 3 |
-
emoji:
|
| 4 |
-
colorFrom:
|
| 5 |
-
colorTo:
|
| 6 |
sdk: static
|
| 7 |
-
pinned:
|
| 8 |
license: mit
|
| 9 |
---
|
| 10 |
|
| 11 |
-
#
|
| 12 |
|
| 13 |
-
A
|
| 14 |
|
| 15 |
## Summary
|
| 16 |
|
| 17 |
-
|
| 18 |
|
| 19 |
-
|
| 20 |
|
| 21 |
-
##
|
| 22 |
|
| 23 |
-
|
| 24 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 25 |
|
| 26 |
-
###
|
| 27 |
-
- **Universal Process Analysis:** Domain-agnostic methodology applicable across biology, chemistry, software engineering, business processes, and more
|
| 28 |
-
- **LLM-Powered Extraction:** Automated extraction of process steps, decision points, and logic flows using Google Gemini 2.0 Flash
|
| 29 |
-
- **Structured Visualization:** Mermaid.js-based flowchart generation encoded as JSON for programmatic access and integration
|
| 30 |
-
- **Iterative Refinement:** Systematic approach enabling continuous improvement through visualization and LLM-assisted refinement
|
| 31 |
|
| 32 |
-
|
| 33 |
-
-
|
| 34 |
-
-
|
| 35 |
-
-
|
| 36 |
-
-
|
|
|
|
|
|
|
| 37 |
|
| 38 |
-
###
|
| 39 |
-
The Programming Framework serves as the **foundational meta-tool** of the CopernicusAI Knowledge Engine, providing the underlying methodology that enables specialized applications:
|
| 40 |
|
| 41 |
-
|
| 42 |
-
- **
|
| 43 |
-
- **
|
| 44 |
-
- **
|
| 45 |
-
- **
|
|
|
|
| 46 |
|
| 47 |
-
|
| 48 |
|
| 49 |
-
##
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 50 |
|
| 51 |
-
|
| 52 |
|
| 53 |
-
##
|
| 54 |
|
| 55 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 56 |
|
| 57 |
-
|
|
|
|
|
|
|
|
|
|
| 58 |
|
| 59 |
-
|
| 60 |
|
| 61 |
-
|
| 62 |
-
2. **LLM Analysis** - AI extracts steps, decisions, branches, and logic flow
|
| 63 |
-
3. **Generate Flowchart** - Create Mermaid diagram encoded as JSON structure
|
| 64 |
-
4. **Visualize & Iterate** - Interactive flowchart reveals insights and enables refinement
|
| 65 |
|
| 66 |
-
|
|
|
|
|
|
|
|
|
|
| 67 |
|
| 68 |
-
|
| 69 |
-
Works across any field: biology, chemistry, software engineering, business processes, legal workflows, manufacturing, and beyond.
|
| 70 |
|
| 71 |
-
|
| 72 |
-
Start with rough analysis, visualize, identify gaps, refine with LLM, repeat until the process logic is crystal clear.
|
| 73 |
|
| 74 |
-
###
|
| 75 |
-
JSON storage enables programmatic access, version control, cross-referencing, and integration with other tools and databases.
|
| 76 |
|
| 77 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 78 |
|
| 79 |
-
###
|
| 80 |
-
First specialized application: visualizing biochemical processes like DNA replication, metabolic pathways, and cell signaling.
|
| 81 |
-
- [Explore GLMP β](https://huggingface.co/spaces/garywelz/glmp)
|
| 82 |
|
| 83 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 84 |
|
| 85 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 86 |
|
| 87 |
-
|
| 88 |
-
-
|
| 89 |
-
-
|
| 90 |
-
-
|
|
|
|
| 91 |
|
| 92 |
-
|
| 93 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 94 |
|
| 95 |
-
|
| 96 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 97 |
|
| 98 |
-
###
|
| 99 |
-
- [Physics Database Table](https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/physics-processes-database/physics-database-table.html) - Interactive database with 21 processes across 7 subcategories
|
| 100 |
|
| 101 |
-
|
| 102 |
-
|
|
|
|
| 103 |
|
| 104 |
-
##
|
| 105 |
|
| 106 |
-
|
| 107 |
-
- Google Gemini 2.0 Flash for analysis
|
| 108 |
-
- Vertex AI for enterprise deployment
|
| 109 |
-
- Custom prompts for process extraction
|
| 110 |
-
- Structured JSON output formatting
|
| 111 |
|
| 112 |
-
###
|
| 113 |
-
-
|
| 114 |
-
-
|
| 115 |
-
-
|
| 116 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 117 |
|
| 118 |
-
|
| 119 |
-
|
| 120 |
-
|
| 121 |
-
-
|
| 122 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 123 |
|
| 124 |
-
|
| 125 |
-
- GLMP specialized collections
|
| 126 |
-
- CopernicusAI knowledge graph
|
| 127 |
-
- Research papers database
|
| 128 |
-
- API endpoints for programmatic access
|
| 129 |
|
| 130 |
-
|
| 131 |
|
| 132 |
-
|
| 133 |
-
Hugging Face Spaces. https://huggingface.co/spaces/garywelz/programming_framework
|
| 134 |
|
| 135 |
-
|
| 136 |
-
|
|
|
|
|
|
|
| 137 |
|
| 138 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 139 |
|
| 140 |
-
|
| 141 |
|
| 142 |
-
|
| 143 |
|
| 144 |
-
|
| 145 |
-
First specialized application of the Programming Framework to biochemical processes. 100+ biological pathways visualized.
|
| 146 |
-
- [Visit GLMP β](https://huggingface.co/spaces/garywelz/glmp)
|
| 147 |
|
| 148 |
-
###
|
| 149 |
-
|
| 150 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 151 |
|
| 152 |
-
|
| 153 |
|
| 154 |
-
|
| 155 |
-
- Scientific Method
|
| 156 |
-
- Software Deployment Pipeline
|
| 157 |
-
- Customer Support Workflow
|
| 158 |
-
- Research Paper Publication
|
| 159 |
|
| 160 |
-
|
|
|
|
| 161 |
|
| 162 |
-
|
| 163 |
|
| 164 |
-
|
| 165 |
-
- **Visualization**: Mermaid.js
|
| 166 |
-
- **Storage**: Google Cloud Storage, Firestore
|
| 167 |
-
- **Format**: JSON with Mermaid syntax
|
| 168 |
-
- **Frontend**: Static HTML + Tailwind CSS
|
| 169 |
|
| 170 |
-
|
| 171 |
|
| 172 |
-
|
|
|
|
|
|
|
|
|
|
| 173 |
|
| 174 |
---
|
| 175 |
|
| 176 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 177 |
|
| 178 |
-
|
| 179 |
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
title: CopernicusAI - Research-Driven Podcast Generation Platform
|
| 3 |
+
emoji: π¬
|
| 4 |
+
colorFrom: purple
|
| 5 |
+
colorTo: blue
|
| 6 |
sdk: static
|
| 7 |
+
pinned: false
|
| 8 |
license: mit
|
| 9 |
---
|
| 10 |
|
| 11 |
+
# π¬ CopernicusAI - Knowledge Engine for Scientific Discovery
|
| 12 |
|
| 13 |
+
A collaborative research platform that transforms cutting-edge scientific research into accessible, multi-format tools for collective knowledge exploration. These are research instrumentsβlike microscopes for observing the collective knowledge of humanityβenabling hypothesis formation, testing, and discovery across scientific disciplines.
|
| 14 |
|
| 15 |
## Summary
|
| 16 |
|
| 17 |
+
**CopernicusAI** is an operational research platform that synthesizes scientific literature from 250+ million papers into AI-generated podcasts, integrates with a knowledge graph of 12,000+ indexed papers, and provides collaborative tools for research discovery. The system demonstrates production-ready multi-source research synthesis with full citation tracking and evidence-based content generation requiring minimum 3 research sources per episode.
|
| 18 |
|
| 19 |
+
The platform includes a fully operational Knowledge Engine Dashboard (deployed December 2025) with interactive knowledge graph visualization, vector search, and RAG capabilities, enabling researchers to explore, query, and synthesize scientific knowledge across disciplines.
|
| 20 |
|
| 21 |
+
## Prior Work: CopernicusAI Research Interface
|
| 22 |
|
| 23 |
+
CopernicusAI is an active research prototype exploring AI-generated audio briefings as an interface for assisted scientific research.
|
| 24 |
+
|
| 25 |
+
The system allows any user to generate, refine, and share AI-generated science podcasts based on structured prompts, enabling rapid orientation to a topic, iterative deepening, and personalized research briefings.
|
| 26 |
+
|
| 27 |
+
Rather than functioning as a static content platform, CopernicusAI supports collectively generated and shared research artifacts, analogous to community-driven knowledge platforms (e.g., discussion forums), but grounded in scientific sources and metadata-aware workflows.
|
| 28 |
+
|
| 29 |
+
This work demonstrates technical feasibility for:
|
| 30 |
+
- AI-assisted research briefing and orientation
|
| 31 |
+
- Iterative question refinement via conversational interfaces
|
| 32 |
+
- Integration of text, audio, and metadata in research workflows
|
| 33 |
+
|
| 34 |
+
### Current Implementation (December 2025)
|
| 35 |
+
|
| 36 |
+
The Knowledge Engine Dashboard is **fully operational** and deployed to Google Cloud Run, providing unified access to all components with interactive knowledge graph visualization, vector search, RAG queries, and content browsing.
|
| 37 |
+
|
| 38 |
+
## π― Mission & Vision
|
| 39 |
+
|
| 40 |
+
Inspired by Nicolaus Copernicus who challenged accepted knowledge with evidence and rigorous analysis, **CopernicusAI** creates collaborative research tools that enable collective participation in scientific discovery. These platforms are instruments for exploring humanity's collective knowledgeβtools for hypothesis formation, testing, and collaborative research, not just educational content.
|
| 41 |
+
|
| 42 |
+
Just as a microscope enables observation of the microscopic world, CopernicusAI tools enable observation and exploration of humanity's collective knowledge. Subscribers collaborate to prompt, generate, and refine research contentβsharing discoveries publicly or keeping them private. As large language models (LLMs) and AI systems gain unprecedented knowledge, CopernicusAI provides the infrastructure for human-AI collaborative knowledge exploration, with evidence-based truth-seeking as our guiding principle.
|
| 43 |
+
|
| 44 |
+
---
|
| 45 |
+
|
| 46 |
+
## π Core Platform Capabilities
|
| 47 |
+
|
| 48 |
+
### ποΈ AI-Powered Podcast Generation
|
| 49 |
+
|
| 50 |
+
**Production-Ready System:**
|
| 51 |
+
- Collaborative platform where subscribers prompt and generate multi-voice AI podcasts (5-10 minutes) synthesizing research from multiple academic sources
|
| 52 |
+
- Subscribers can share their podcasts publicly or keep them private
|
| 53 |
+
- Evidence-based content generation requiring minimum 3 research sources per episode
|
| 54 |
+
- Comprehensive research integration across 8+ academic databases
|
| 55 |
+
- **64 episodes** generated across Biology, Chemistry, Computer Science, Mathematics, and Physics
|
| 56 |
+
- Automated audio synthesis with professional multi-speaker dialogue
|
| 57 |
+
- AI-generated episode thumbnails with scientific visualizations
|
| 58 |
+
- RSS feed distribution compatible with Spotify, Apple Podcasts, Google Podcasts
|
| 59 |
+
|
| 60 |
+
**Research Integration:**
|
| 61 |
+
- Real-time discovery from PubMed, arXiv, NASA ADS, Zenodo, bioRxiv, CORE, Google Scholar, and News APIs
|
| 62 |
+
- Parallel search across multiple databases for comprehensive coverage
|
| 63 |
+
- Quality scoring and relevance ranking of research sources
|
| 64 |
+
- Paradigm shift identification and interdisciplinary connection analysis
|
| 65 |
+
- Automatic citation extraction and formatting
|
| 66 |
+
- Source validation and authenticity verification
|
| 67 |
+
|
| 68 |
+
### π€ Advanced LLM Integration
|
| 69 |
+
|
| 70 |
+
**Multi-Model Architecture:**
|
| 71 |
+
- **Google Gemini 3** - Latest research analysis and content generation
|
| 72 |
+
- **OpenAI GPT-4/GPT-3.5** - Content synthesis and quality validation
|
| 73 |
+
- **Anthropic Claude 3** (Sonnet, Haiku via OpenRouter) - Alternative reasoning paths
|
| 74 |
+
- **ElevenLabs TTS** - Multi-voice text-to-speech synthesis
|
| 75 |
+
- Model selection based on task complexity and expertise level
|
| 76 |
+
- Fallback chains for reliability and cost optimization
|
| 77 |
+
|
| 78 |
+
**Capabilities:**
|
| 79 |
+
- Multi-paper analysis and synthesis
|
| 80 |
+
- Paradigm shift detection in research domains
|
| 81 |
+
- Interdisciplinary connection identification
|
| 82 |
+
- Entity extraction (genes, proteins, chemical compounds, mathematical concepts)
|
| 83 |
+
- Citation tracking and cross-reference analysis
|
| 84 |
+
- Content quality scoring and validation
|
| 85 |
+
|
| 86 |
+
### π Research Resource Access
|
| 87 |
+
|
| 88 |
+
**Comprehensive Academic Database Coverage:**
|
| 89 |
+
|
| 90 |
+
Our research pipeline integrates with **8+ major academic databases**, providing access to:
|
| 91 |
+
|
| 92 |
+
- **PubMed/NCBI** (~30+ million biomedical papers)
|
| 93 |
+
- **arXiv** (~2+ million preprints in physics, mathematics, CS, quantitative biology)
|
| 94 |
+
- **NASA ADS** (~15+ million astronomy/astrophysics papers)
|
| 95 |
+
- **Zenodo** (100K+ open science datasets and publications)
|
| 96 |
+
- **bioRxiv/medRxiv** (preprints in life sciences)
|
| 97 |
+
- **CORE** (~200+ million open access papers)
|
| 98 |
+
- **Google Scholar** (comprehensive academic search)
|
| 99 |
+
- **News API** (current events and trending research topics)
|
| 100 |
+
- **YouTube Data API** (academic videos, conference talks, lectures)
|
| 101 |
+
|
| 102 |
+
**Total Access:** **250+ million research papers and academic resources** across all major scientific disciplines.
|
| 103 |
+
|
| 104 |
+
### ποΈ Audio and Video Podcast Production
|
| 105 |
+
|
| 106 |
+
**Operating Audio Podcast System:**
|
| 107 |
+
Full production and distribution platform for subscriber-generated podcasts. Users can prompt, generate, publish, and distribute audio podcasts with RSS feed support for Spotify, Apple Podcasts, and Google Podcasts.
|
| 108 |
+
|
| 109 |
+
- Multi-voice AI podcast generation
|
| 110 |
+
- Research-driven content creation
|
| 111 |
+
- RSS feed distribution
|
| 112 |
+
- Public and private podcast options
|
| 113 |
+
- Professional audio quality
|
| 114 |
+
|
| 115 |
+
**Video Production (Future - Phase 2+):**
|
| 116 |
+
|
| 117 |
+
Advanced video features planned for future development:
|
| 118 |
+
|
| 119 |
+
**Planned Advanced Features (Phase 2-4):**
|
| 120 |
+
- **Visual Content Integration:**
|
| 121 |
+
- Automated extraction of figures and diagrams from research papers
|
| 122 |
+
- Screen capture and processing of academic illustrations
|
| 123 |
+
- Web scraping from scientific journal websites and preprint servers
|
| 124 |
+
- JSON database integration for structured visual data
|
| 125 |
+
|
| 126 |
+
- **Dynamic Visualization Generation:**
|
| 127 |
+
- On-the-fly scientific animations (molecular structures, data flows, algorithms)
|
| 128 |
+
- Real-time chart and graph generation from research data
|
| 129 |
+
- Python-based animations using matplotlib, plotly, mayavi
|
| 130 |
+
- Mathematical formula rendering (LaTeX β video)
|
| 131 |
+
|
| 132 |
+
- **External Video Quoting:**
|
| 133 |
+
- YouTube video segment extraction and integration
|
| 134 |
+
- Time-stamped video quoting with proper attribution
|
| 135 |
+
- Educational fair use compliance
|
| 136 |
+
- Source video discovery during research phase
|
| 137 |
+
|
| 138 |
+
- **Advanced Composition:**
|
| 139 |
+
- Multi-layer video composition (background, content, overlays, effects)
|
| 140 |
+
- Automatic subtitle generation from transcripts
|
| 141 |
+
- Text overlay system (key concepts, citations, speaker identification)
|
| 142 |
+
- Professional transitions and effects
|
| 143 |
+
- Audio-visual synchronization
|
| 144 |
+
|
| 145 |
+
**See:** [Science Video Database](https://huggingface.co/spaces/garywelz/sciencevideodb) - Companion project for research video content management.
|
| 146 |
+
|
| 147 |
+
### π Research Papers Metadata Database (Phase 2)
|
| 148 |
+
|
| 149 |
+
**Planned Implementation:**
|
| 150 |
+
A centralized **metadata repository** (not a file archive) that provides:
|
| 151 |
+
|
| 152 |
+
- **Structured JSON Objects:** Research paper metadata including:
|
| 153 |
+
- DOI, arXiv ID, publication information
|
| 154 |
+
- Abstracts and key findings
|
| 155 |
+
- Extracted entities (genes, proteins, chemical compounds, equations)
|
| 156 |
+
- Citation networks and cross-references
|
| 157 |
+
- Paradigm shift indicators
|
| 158 |
+
- Interdisciplinary connections
|
| 159 |
+
- Quality scores and relevance metrics
|
| 160 |
+
|
| 161 |
+
- **AI-Powered Preprocessing:**
|
| 162 |
+
- LLM-based entity extraction and annotation
|
| 163 |
+
- Automatic categorization by discipline and subdomain
|
| 164 |
+
- Keyword extraction and semantic tagging
|
| 165 |
+
- Citation tracking and relationship mapping
|
| 166 |
+
- Quality assessment and validation
|
| 167 |
+
|
| 168 |
+
- **Integration Features:**
|
| 169 |
+
- DOI/arXiv ID resolution and metadata enrichment
|
| 170 |
+
- Cross-reference linking between papers
|
| 171 |
+
- Podcast-to-paper relationship tracking
|
| 172 |
+
- Search and query capabilities
|
| 173 |
+
- API access for programmatic retrieval
|
| 174 |
+
|
| 175 |
+
**Technical Architecture:**
|
| 176 |
+
- Firestore NoSQL database for flexible JSON storage
|
| 177 |
+
- Google Cloud Functions for automated metadata processing
|
| 178 |
+
- Vertex AI for entity extraction and analysis
|
| 179 |
+
- RESTful API for external access
|
| 180 |
+
|
| 181 |
+
**Benefits:**
|
| 182 |
+
- Enables rapid research discovery across podcasts
|
| 183 |
+
- Supports knowledge graph construction
|
| 184 |
+
- Facilitates cross-disciplinary pattern recognition
|
| 185 |
+
- Provides foundation for semantic search capabilities
|
| 186 |
+
|
| 187 |
+
---
|
| 188 |
+
|
| 189 |
+
## ποΈ System Architecture
|
| 190 |
+
|
| 191 |
+
### Database Structure (Firestore)
|
| 192 |
+
|
| 193 |
+
**Collections:**
|
| 194 |
+
- **`subscribers`** - User accounts, preferences, subscription tiers, usage analytics
|
| 195 |
+
- **`podcast_jobs`** - Generated podcasts with full metadata, source papers, engagement metrics
|
| 196 |
+
- **`episodes`** - Published episodes with RSS distribution status
|
| 197 |
+
- **`research_papers`** (Phase 2) - Paper metadata database with AI-extracted entities
|
| 198 |
|
| 199 |
+
### Storage Structure (Google Cloud Storage)
|
|
|
|
|
|
|
|
|
|
|
|
|
| 200 |
|
| 201 |
+
- **`audio/`** - MP3 podcast files (multi-voice ElevenLabs synthesis)
|
| 202 |
+
- **`videos/`** - MP4 video podcasts (current and future)
|
| 203 |
+
- **`transcripts/`** - Full text transcripts with speaker markers
|
| 204 |
+
- **`descriptions/`** - Markdown descriptions with academic references
|
| 205 |
+
- **`thumbnails/`** - AI-generated episode artwork (DALL-E 3)
|
| 206 |
+
- **`video-assets/`** - Extracted figures, animations, visual content
|
| 207 |
+
- **`glmp-v2/`** - Genome Logic Modeling Project flowcharts (JSON)
|
| 208 |
|
| 209 |
+
### Backend Services (Google Cloud Run)
|
|
|
|
| 210 |
|
| 211 |
+
**Microservices Architecture:**
|
| 212 |
+
- **Podcast Generation Service** - Orchestrates research, content generation, and media production
|
| 213 |
+
- **Research Pipeline Service** - Multi-API academic search and analysis
|
| 214 |
+
- **Video Generation Service** - Video composition and encoding (Phase 1 complete)
|
| 215 |
+
- **RSS Service** - Feed generation and distribution
|
| 216 |
+
- **Episode Service** - Catalog management and metadata
|
| 217 |
|
| 218 |
+
---
|
| 219 |
|
| 220 |
+
## βοΈ Technology Stack
|
| 221 |
+
|
| 222 |
+
### AI & Machine Learning
|
| 223 |
+
- **Google Gemini 3** - Latest LLM for research analysis
|
| 224 |
+
- **Google Vertex AI** - Enterprise-scale model deployment and orchestration (used throughout platform)
|
| 225 |
+
- **OpenAI GPT-4/GPT-3.5** - Content synthesis and validation
|
| 226 |
+
- **Anthropic Claude 3** - Alternative reasoning via OpenRouter
|
| 227 |
+
- **ElevenLabs TTS** - Multi-voice text-to-speech synthesis
|
| 228 |
+
- **DALL-E 3** - AI-generated scientific visualizations
|
| 229 |
+
- **Google Cloud Vision API** - Image analysis and quality assessment
|
| 230 |
+
- **Video Intelligence API** - Scene detection and content analysis
|
| 231 |
+
|
| 232 |
+
### Backend Infrastructure
|
| 233 |
+
- **FastAPI** (Python) - RESTful API framework
|
| 234 |
+
- **Google Cloud Run** - Serverless container deployment
|
| 235 |
+
- **Firestore** - NoSQL document database
|
| 236 |
+
- **Cloud Storage** - Media file storage and CDN
|
| 237 |
+
- **Cloud Functions** - Event-driven processing
|
| 238 |
+
- **Cloud Tasks** - Background job queuing
|
| 239 |
+
- **Secret Manager** - API key and credential management
|
| 240 |
+
|
| 241 |
+
### Media Processing
|
| 242 |
+
- **FFmpeg** - Video encoding and composition
|
| 243 |
+
- **MoviePy** - Python video editing (planned)
|
| 244 |
+
- **Matplotlib/Plotly** - Scientific visualization (planned)
|
| 245 |
+
- **PyPDF2/pdfplumber** - PDF processing (planned)
|
| 246 |
+
|
| 247 |
+
### Frontend
|
| 248 |
+
- **Next.js 15.5.7** - React framework
|
| 249 |
+
- **Alpine.js** - Lightweight reactive UI
|
| 250 |
+
- **Tailwind CSS** - Utility-first styling
|
| 251 |
+
- **Vercel** - Frontend hosting and deployment
|
| 252 |
|
| 253 |
+
---
|
| 254 |
|
| 255 |
+
## π Platform Capabilities
|
| 256 |
|
| 257 |
+
### Research Coverage
|
| 258 |
+
- **250+ million research papers** accessible through integrated APIs
|
| 259 |
+
- **8+ academic databases** integrated with parallel search
|
| 260 |
+
- **Minimum 3 sources** required per episode for quality assurance
|
| 261 |
+
- **Multi-paper analysis** for comprehensive coverage
|
| 262 |
|
| 263 |
+
### Platform Features
|
| 264 |
+
- **Subscriber-driven content generation** - Users prompt and create podcasts
|
| 265 |
+
- **RSS feed distribution** to major podcast platforms
|
| 266 |
+
- **Public and private podcast options** - Share discoveries or keep them private
|
| 267 |
|
| 268 |
+
---
|
| 269 |
|
| 270 |
+
## π Live Platform & Resources
|
|
|
|
|
|
|
|
|
|
| 271 |
|
| 272 |
+
### Production Deployment
|
| 273 |
+
- π **[Homepage - Browse Podcasts](https://www.copernicusai.fyi)** - Public podcast catalog
|
| 274 |
+
- π **[Creator Dashboard](https://www.copernicusai.fyi/subscriber-dashboard.html)** - Subscriber interface
|
| 275 |
+
- π‘ **[RSS Feed](https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/feeds/copernicus-mvp-rss-feed.xml)** - Podcast distribution feed
|
| 276 |
|
| 277 |
+
## π§© CopernicusAI Knowledge Engine Components
|
|
|
|
| 278 |
|
| 279 |
+
The CopernicusAI Knowledge Engine is an integrated ecosystem of research and collaboration tools. The Knowledge Engine is **fully implemented and operational** (December 2025), with a working system deployed to Google Cloud Run. Currently, the platform includes five core components, with additional tools, databases, and collaboration features planned for future development:
|
|
|
|
| 280 |
|
| 281 |
+
### π― Knowledge Engine Implementation (December 2025)
|
|
|
|
| 282 |
|
| 283 |
+
**Fully Operational System:**
|
| 284 |
+
- **Live Dashboard:** https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine
|
| 285 |
+
- **Knowledge Graph:** Interactive visualization with 12,000+ indexed mathematics papers, relationship extraction (citations, semantic similarity, categories), and graph query capabilities
|
| 286 |
+
- **Vector Search:** Semantic search using Vertex AI embeddings across papers, podcasts, and processes
|
| 287 |
+
- **RAG System:** Retrieval-augmented generation with citation support, context retrieval, and multi-modal content integration
|
| 288 |
+
- **Unified Web Dashboard:** Production-ready interface with knowledge map visualization, search, RAG queries, content browsing, and statistics
|
| 289 |
+
- **Architecture:** FastAPI backend, Next.js frontend, Firestore database, Vertex AI for embeddings and LLM capabilities, Model Context Protocol (MCP) server for AI assistant integration
|
| 290 |
+
- **Deployment:** Fully deployed to Google Cloud Run, accessible 24/7
|
| 291 |
|
| 292 |
+
### Core Components
|
|
|
|
|
|
|
| 293 |
|
| 294 |
+
1. **π¬ CopernicusAI (This Platform)** - Core synthesis and distribution component
|
| 295 |
+
- AI-powered research synthesis and podcast generation
|
| 296 |
+
- Multi-API research integration (250+ million papers)
|
| 297 |
+
- Subscriber-driven content creation and sharing
|
| 298 |
+
- RSS feed distribution and platform management
|
| 299 |
|
| 300 |
+
2. **π οΈ Programming Framework** - Foundational meta-tool
|
| 301 |
+
- Universal method for process analysis across any discipline
|
| 302 |
+
- LLM-powered extraction and Mermaid visualization
|
| 303 |
+
- Domain-agnostic methodology for complex process analysis
|
| 304 |
+
- [Explore Framework β](https://huggingface.co/spaces/garywelz/programming_framework)
|
| 305 |
|
| 306 |
+
3. **𧬠GLMP - Genome Logic Modeling Project** - Specialized biological application
|
| 307 |
+
- First application of Programming Framework to biology
|
| 308 |
+
- 50+ biological processes visualized as interactive flowcharts
|
| 309 |
+
- JSON-based structured data in Google Cloud Storage
|
| 310 |
+
- [Explore GLMP β](https://huggingface.co/spaces/garywelz/glmp)
|
| 311 |
|
| 312 |
+
4. **π Research Paper Metadata Database** - Core data infrastructure
|
| 313 |
+
- Centralized metadata repository for scientific research papers
|
| 314 |
+
- AI-powered preprocessing and entity extraction
|
| 315 |
+
- Citation network analysis and relationship mapping
|
| 316 |
+
- Foundation for knowledge graph construction
|
| 317 |
+
- [Explore Metadata Database β](https://huggingface.co/spaces/garywelz/metadata_database)
|
| 318 |
|
| 319 |
+
5. **π¬ Science Video Database** - Multi-modal content component
|
| 320 |
+
- Curated searchable database of scientific video content
|
| 321 |
+
- Transcript-based search across multiple disciplines
|
| 322 |
+
- Integration with YouTube and other video sources
|
| 323 |
+
- [Explore Video Database β](https://huggingface.co/spaces/garywelz/sciencevideodb)
|
| 324 |
+
- [Live Demo β](https://scienceviddb-web-204731194849.us-central1.run.app/)
|
| 325 |
|
| 326 |
+
### Future Components
|
|
|
|
| 327 |
|
| 328 |
+
The Knowledge Engine is designed to grow and evolve. Additional tools, databases, and collaboration components will be added as the project develops, expanding capabilities for AI-assisted scientific research and knowledge discovery.
|
| 329 |
+
|
| 330 |
+
---
|
| 331 |
|
| 332 |
+
## π API Documentation
|
| 333 |
|
| 334 |
+
**Base URL:** `https://copernicus-podcast-api-phzp4ie2sq-uc.a.run.app`
|
|
|
|
|
|
|
|
|
|
|
|
|
| 335 |
|
| 336 |
+
### Podcast Generation Endpoints
|
| 337 |
+
- `POST /generate-podcast-with-subscriber` - Generate new podcast from research topic
|
| 338 |
+
- `GET /api/subscribers/podcasts/{id}` - Retrieve podcast details
|
| 339 |
+
- `POST /api/subscribers/podcasts/submit-to-rss` - Publish to RSS feed
|
| 340 |
+
|
| 341 |
+
### Research Endpoints
|
| 342 |
+
- `POST /api/papers/upload` - Upload paper metadata (Phase 2)
|
| 343 |
+
- `GET /api/papers/{paper_id}` - Retrieve paper metadata
|
| 344 |
+
- `POST /api/papers/query` - Query papers by discipline, keywords
|
| 345 |
+
- `POST /api/papers/{id}/link-podcast/{id}` - Link paper to podcast
|
| 346 |
+
|
| 347 |
+
### Admin Endpoints
|
| 348 |
+
- `GET /api/admin/subscribers` - List all subscribers and statistics
|
| 349 |
+
- `POST /api/admin/podcasts/fix-missing-titles` - Content maintenance
|
| 350 |
+
- `GET /api/admin/podcasts/catalog` - Full podcast catalog
|
| 351 |
+
|
| 352 |
+
---
|
| 353 |
|
| 354 |
+
## π Development Roadmap
|
| 355 |
+
|
| 356 |
+
### β
Phase 1: Core Platform (Complete)
|
| 357 |
+
- Multi-API research integration
|
| 358 |
+
- AI podcast generation with multi-voice synthesis
|
| 359 |
+
- RSS feed distribution
|
| 360 |
+
- Subscriber platform
|
| 361 |
+
- Basic video generation (static)
|
| 362 |
+
|
| 363 |
+
### π Phase 2: Content Enhancement (In Progress)
|
| 364 |
+
- **Research Papers Metadata Database** - JSON-based metadata repository
|
| 365 |
+
- **Visual Content Extraction** - Figures from papers, web scraping
|
| 366 |
+
- **YouTube Video Quoting** - External video integration with attribution
|
| 367 |
+
- **Advanced Video Features** - Multi-layer composition, animations
|
| 368 |
+
|
| 369 |
+
### π Phase 3: Advanced Visualizations (Planned)
|
| 370 |
+
- Scientific animation generation (matplotlib, plotly)
|
| 371 |
+
- Real-time data visualization
|
| 372 |
+
- Mathematical formula rendering
|
| 373 |
+
- Dynamic graph and network visualizations
|
| 374 |
+
|
| 375 |
+
### β
Phase 4: Knowledge Integration (Implemented - December 2025)
|
| 376 |
+
- **Knowledge Graph:** Fully operational with interactive visualization, 12,000+ papers indexed
|
| 377 |
+
- **Vector Search:** Semantic search implemented using Vertex AI embeddings
|
| 378 |
+
- **RAG System:** Retrieval-augmented generation with citations operational
|
| 379 |
+
- **Cross-Disciplinary Pattern Discovery:** Relationship extraction across papers, concepts, and categories
|
| 380 |
+
- **AI-Powered Content Recommendations:** Integrated into unified web dashboard
|
| 381 |
+
- **Live System:** https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine
|
| 382 |
|
| 383 |
+
---
|
|
|
|
|
|
|
|
|
|
|
|
|
| 384 |
|
| 385 |
+
## π¬ Collaborative Research Tools
|
| 386 |
|
| 387 |
+
**These platforms enable collective participation and collaboration across diverse user communities:**
|
|
|
|
| 388 |
|
| 389 |
+
- **Researchers** - Tools for hypothesis formation and testing, rapid synthesis of cross-disciplinary findings
|
| 390 |
+
- **Collaborators** - Collective knowledge exploration and refinement
|
| 391 |
+
- **Subscribers** - Prompt, generate, and share podcasts (public or private)
|
| 392 |
+
- **Community** - User suggestions, comments, and collaborative flowchart improvement (GLMP)
|
| 393 |
|
| 394 |
+
**Key Innovations:**
|
| 395 |
+
- **Multi-Source Validation** - Requires minimum 3 research sources per episode
|
| 396 |
+
- **Evidence-Based Generation** - No content generated without research backing
|
| 397 |
+
- **Paradigm Shift Detection** - Identifies revolutionary vs. incremental research
|
| 398 |
+
- **Interdisciplinary Connections** - Reveals cross-domain insights
|
| 399 |
+
- **Collaborative Participation** - User-driven content generation and sharing
|
| 400 |
+
- **Reproducibility** - Full citation tracking and source attribution
|
| 401 |
|
| 402 |
+
> *Like a microscope enables observation of the microscopic world, these tools enable observation and exploration of humanity's collective knowledge.*
|
| 403 |
|
| 404 |
+
---
|
| 405 |
|
| 406 |
+
## π Prior Work & Research Contributions
|
|
|
|
|
|
|
| 407 |
|
| 408 |
+
### Overview
|
| 409 |
+
This platform represents **prior work** that demonstrates foundational research and development achievements in AI-powered scientific knowledge synthesis, collaborative research tools, and multi-modal content generation. These contributions establish the technical foundation and proof-of-concept for the broader **CopernicusAI Knowledge Engine** initiative.
|
| 410 |
+
|
| 411 |
+
### Research Contributions
|
| 412 |
+
|
| 413 |
+
**1. AI-Powered Research Synthesis System**
|
| 414 |
+
- Developed and deployed a production-ready system for multi-source research synthesis using LLMs
|
| 415 |
+
- Demonstrated integration of 8+ academic databases (250+ million papers) with parallel search capabilities
|
| 416 |
+
- Implemented evidence-based content generation requiring minimum 3 research sources per output
|
| 417 |
+
- Achieved operational deployment with 64+ generated podcast episodes across 5 scientific disciplines
|
| 418 |
+
|
| 419 |
+
**2. Multi-Model LLM Architecture**
|
| 420 |
+
- Designed and implemented intelligent model selection framework using Google Gemini 3, OpenAI GPT-4, and Anthropic Claude 3
|
| 421 |
+
- Developed fallback chains for reliability and cost optimization
|
| 422 |
+
- Demonstrated paradigm shift detection and interdisciplinary connection identification in research domains
|
| 423 |
+
- Implemented entity extraction (genes, proteins, chemical compounds, mathematical concepts) from research literature
|
| 424 |
+
|
| 425 |
+
**3. Collaborative Research Platform Infrastructure**
|
| 426 |
+
- Built subscriber-driven content generation system enabling public/private research sharing
|
| 427 |
+
- Implemented RSS feed distribution compatible with major podcast platforms
|
| 428 |
+
- Developed microservices architecture on Google Cloud Run with Firestore and Cloud Storage
|
| 429 |
+
- Created RESTful API framework for programmatic access to research synthesis capabilities
|
| 430 |
+
|
| 431 |
+
**4. Integration with Knowledge Engine Components**
|
| 432 |
+
- Established integration pathways with GLMP (Genome Logic Modeling Project) for biological process visualization
|
| 433 |
+
- Designed architecture for Research Papers Metadata Database (Phase 2)
|
| 434 |
+
- Planned integration with Science Video Database for multi-modal content
|
| 435 |
+
- Created framework for Programming Framework integration across disciplines
|
| 436 |
+
|
| 437 |
+
### Technical Achievements
|
| 438 |
+
|
| 439 |
+
**Production Deployment:**
|
| 440 |
+
- Live platform: https://www.copernicusai.fyi
|
| 441 |
+
- Operational API: https://copernicus-podcast-api-phzp4ie2sq-uc.a.run.app
|
| 442 |
+
- RSS feed distribution: Active and functional
|
| 443 |
+
- Multi-voice audio synthesis: ElevenLabs TTS integration operational
|
| 444 |
+
|
| 445 |
+
**Research Infrastructure:**
|
| 446 |
+
- 250+ million research papers accessible via integrated APIs
|
| 447 |
+
- 8+ academic database integrations (PubMed, arXiv, NASA ADS, Zenodo, bioRxiv, CORE, Google Scholar, News API)
|
| 448 |
+
- **12,000+ mathematics papers indexed** with full metadata and vector embeddings in Knowledge Engine
|
| 449 |
+
- Automated citation extraction and formatting
|
| 450 |
+
- Quality scoring and relevance ranking systems
|
| 451 |
+
- **Knowledge Graph:** Fully operational with relationship extraction and interactive visualization
|
| 452 |
+
- **Vector Search:** Semantic search across papers, podcasts, and processes
|
| 453 |
+
- **RAG System:** Operational with citation support and multi-modal content integration
|
| 454 |
+
|
| 455 |
+
**Scalability & Architecture:**
|
| 456 |
+
- Serverless microservices architecture (Google Cloud Run)
|
| 457 |
+
- NoSQL database (Firestore) for flexible metadata storage
|
| 458 |
+
- Cloud Storage for media files and structured data
|
| 459 |
+
- Event-driven processing with Cloud Functions and Cloud Tasks
|
| 460 |
+
|
| 461 |
+
### Position Within CopernicusAI Knowledge Engine
|
| 462 |
+
|
| 463 |
+
This platform serves as the **core synthesis and distribution component** of the CopernicusAI Knowledge Engine. The Knowledge Engine is an integrated ecosystem of research and collaboration tools that work together to assist scientists in their workflow, from research discovery through knowledge synthesis to multi-format content generation.
|
| 464 |
+
|
| 465 |
+
**Current Components:**
|
| 466 |
+
1. **CopernicusAI** (This platform) - Core synthesis and distribution component for AI-powered research synthesis and podcast generation
|
| 467 |
+
2. **Knowledge Engine Dashboard** (β
Implemented December 2025) - Fully operational web interface with knowledge graph visualization, vector search, RAG queries, content browsing, and statistics. Live at: https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine
|
| 468 |
+
3. **Programming Framework** - Foundational meta-tool providing universal process analysis methodology
|
| 469 |
+
4. **GLMP (Genome Logic Modeling Project)** - Specialized biological application demonstrating domain-specific use of the Programming Framework
|
| 470 |
+
5. **Research Paper Metadata Database** - Core data infrastructure providing structured metadata and citation networks
|
| 471 |
+
6. **Science Video Database** - Multi-modal content component enabling video-based learning and research discovery
|
| 472 |
+
|
| 473 |
+
**Future Development:**
|
| 474 |
+
The Knowledge Engine is designed to grow and evolve. Additional tools, databases, and collaboration components will be added as the project develops, expanding capabilities for AI-assisted scientific research and knowledge discovery.
|
| 475 |
+
|
| 476 |
+
### Academic & Research Impact
|
| 477 |
+
|
| 478 |
+
**Publications & Presentations:**
|
| 479 |
+
- Platform architecture and methodology suitable for academic publication
|
| 480 |
+
- Open-source components available for research community use
|
| 481 |
+
- Publicly accessible research tools demonstrating AI-human collaboration in scientific knowledge synthesis
|
| 482 |
+
|
| 483 |
+
**Research Applications:**
|
| 484 |
+
- Supports hypothesis formation and testing through rapid multi-source synthesis
|
| 485 |
+
- Enables cross-disciplinary pattern recognition and connection identification
|
| 486 |
+
- Facilitates reproducible research communication with full citation tracking
|
| 487 |
+
- Provides infrastructure for collaborative knowledge exploration
|
| 488 |
+
|
| 489 |
+
**Educational Contributions:**
|
| 490 |
+
- 64+ research-driven podcast episodes across Biology, Chemistry, Computer Science, Mathematics, and Physics
|
| 491 |
+
- Evidence-based content requiring minimum 3 academic sources
|
| 492 |
+
- Public and private sharing options for research dissemination
|
| 493 |
+
- Integration with major podcast platforms for broad accessibility
|
| 494 |
+
|
| 495 |
+
### Citation Information
|
| 496 |
+
|
| 497 |
+
**For Grant Proposals:**
|
| 498 |
+
When citing this work as prior research, please reference:
|
| 499 |
+
|
| 500 |
+
- **Platform Name:** CopernicusAI - Knowledge Engine for Scientific Discovery
|
| 501 |
+
- **URL:** https://huggingface.co/spaces/garywelz/copernicusai
|
| 502 |
+
- **Live Platform:** https://www.copernicusai.fyi
|
| 503 |
+
- **Primary Developer:** Gary Welz
|
| 504 |
+
- **Year:** 2024-2025
|
| 505 |
+
- **License:** MIT
|
| 506 |
+
|
| 507 |
+
**Suggested Citation Format:**
|
| 508 |
+
```
|
| 509 |
+
Welz, G. (2025). CopernicusAI: Knowledge Engine for Scientific Discovery.
|
| 510 |
+
Hugging Face Space. https://huggingface.co/spaces/garywelz/copernicusai
|
| 511 |
+
```
|
| 512 |
+
|
| 513 |
+
## π Grant Support & Collaboration
|
| 514 |
+
|
| 515 |
+
**Grant Applications Supported:**
|
| 516 |
+
This platform is designed to support grant applications to:
|
| 517 |
+
- **NSF (National Science Foundation)** - Science education and research infrastructure
|
| 518 |
+
- **DOE (Department of Energy)** - Scientific computing and data science
|
| 519 |
+
- **SAIR Foundation** - AI research and development initiatives
|
| 520 |
+
|
| 521 |
+
**Research Contributions:**
|
| 522 |
+
- Open-source components and methodologies
|
| 523 |
+
- Publicly accessible research tools
|
| 524 |
+
- Educational content for broader scientific literacy
|
| 525 |
+
- Infrastructure for reproducible research communication
|
| 526 |
+
|
| 527 |
+
**Collaboration Opportunities:**
|
| 528 |
+
- Integration with academic institutions
|
| 529 |
+
- Partnership with research organizations
|
| 530 |
+
- Open data initiatives
|
| 531 |
+
- Educational program development
|
| 532 |
|
| 533 |
+
---
|
| 534 |
|
| 535 |
+
## How to Cite This Work
|
|
|
|
|
|
|
|
|
|
|
|
|
| 536 |
|
| 537 |
+
Welz, G. (2024β2025). *CopernicusAI: AI-Generated Audio Briefings as a Research Interface*.
|
| 538 |
+
Hugging Face Spaces. https://huggingface.co/spaces/garywelz/copernicusai
|
| 539 |
|
| 540 |
+
---
|
| 541 |
|
| 542 |
+
## π License & Attribution
|
|
|
|
|
|
|
|
|
|
|
|
|
| 543 |
|
| 544 |
+
**License:** MIT
|
| 545 |
|
| 546 |
+
**Attributions:**
|
| 547 |
+
- Built with Google Cloud Platform, Gemini AI, OpenAI, Anthropic Claude, and ElevenLabs
|
| 548 |
+
- Research data from PubMed, arXiv, NASA ADS, Zenodo, bioRxiv, CORE, and Google Scholar
|
| 549 |
+
- Academic paper metadata from respective publishers
|
| 550 |
|
| 551 |
---
|
| 552 |
|
| 553 |
+
## π§ Contact & Support
|
| 554 |
+
|
| 555 |
+
For questions, collaboration inquiries, or grant application support:
|
| 556 |
+
- **Hugging Face Space:** [https://huggingface.co/spaces/garywelz/copernicusai](https://huggingface.co/spaces/garywelz/copernicusai)
|
| 557 |
+
- **Platform:** [https://www.copernicusai.fyi](https://www.copernicusai.fyi)
|
| 558 |
+
|
| 559 |
+
---
|
| 560 |
|
| 561 |
+
**Β© 2025 CopernicusAI. All rights reserved.**
|
| 562 |
|
| 563 |
+
*Advancing scientific knowledge through AI-powered research communication and discovery.*
|
index.html
CHANGED
|
@@ -3,12 +3,11 @@
|
|
| 3 |
<head>
|
| 4 |
<meta charset="UTF-8">
|
| 5 |
<meta name="viewport" content="width=device-width, initial-scale=1.0">
|
| 6 |
-
<title>
|
| 7 |
<script src="https://cdn.tailwindcss.com"></script>
|
| 8 |
-
<script src="https://cdn.jsdelivr.net/npm/mermaid/dist/mermaid.min.js"></script>
|
| 9 |
<style>
|
| 10 |
.gradient-bg {
|
| 11 |
-
background: linear-gradient(135deg, #
|
| 12 |
}
|
| 13 |
.card-hover {
|
| 14 |
transition: transform 0.3s ease, box-shadow 0.3s ease;
|
|
@@ -17,6 +16,13 @@
|
|
| 17 |
transform: translateY(-4px);
|
| 18 |
box-shadow: 0 20px 40px rgba(0,0,0,0.15);
|
| 19 |
}
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
</style>
|
| 21 |
</head>
|
| 22 |
<body class="bg-gray-50">
|
|
@@ -24,12 +30,14 @@
|
|
| 24 |
<header class="gradient-bg text-white">
|
| 25 |
<div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-16">
|
| 26 |
<div class="text-center">
|
| 27 |
-
<div class="text-6xl mb-4"
|
| 28 |
-
<h1 class="text-5xl font-bold mb-4">
|
| 29 |
-
<p class="text-xl opacity-90 mb-6">
|
| 30 |
-
<p class="text-lg opacity-75 max-w-
|
| 31 |
-
|
| 32 |
-
|
|
|
|
|
|
|
| 33 |
</p>
|
| 34 |
</div>
|
| 35 |
</div>
|
|
@@ -37,448 +45,685 @@
|
|
| 37 |
|
| 38 |
<!-- Abstract/Summary -->
|
| 39 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
|
| 40 |
-
<div class="bg-white rounded-xl shadow-lg p-8 mb-8 border-l-4 border-
|
| 41 |
<h2 class="text-2xl font-bold text-gray-900 mb-4">π Summary</h2>
|
| 42 |
<p class="text-lg text-gray-700 leading-relaxed mb-3">
|
| 43 |
-
|
| 44 |
</p>
|
| 45 |
<p class="text-gray-600">
|
| 46 |
-
|
| 47 |
</p>
|
| 48 |
</div>
|
| 49 |
</section>
|
| 50 |
|
| 51 |
-
<!-- Prior Work
|
| 52 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
|
| 53 |
-
<div class="bg-gradient-to-r from-
|
| 54 |
-
<h2 class="text-3xl font-bold text-gray-900 mb-
|
| 55 |
|
| 56 |
<div class="bg-white rounded-lg p-6 mb-6">
|
| 57 |
-
<h3 class="text-xl font-semibold text-gray-900 mb-
|
|
|
|
|
|
|
|
|
|
| 58 |
<p class="text-gray-700 mb-4">
|
| 59 |
-
The
|
| 60 |
</p>
|
| 61 |
-
|
| 62 |
-
|
| 63 |
-
|
| 64 |
-
<div class="bg-
|
| 65 |
-
<h3 class="
|
| 66 |
-
<ul class="text-
|
| 67 |
-
<li>β’
|
| 68 |
-
<li>β’
|
| 69 |
-
<li>β’
|
| 70 |
-
<li>β’ <strong>Iterative Refinement:</strong> Systematic approach enabling continuous improvement</li>
|
| 71 |
-
<li>β’ <strong>Scale Demonstration:</strong> Applied to 313+ processes across 5 disciplines (Biology: 52, Chemistry: 91, Physics: 21, Computer Science: 21, Mathematics: 20, GLMP: 109)</li>
|
| 72 |
-
<li>β’ <strong>Validation:</strong> Successfully processes complex biological, chemical, and computational workflows with high accuracy</li>
|
| 73 |
-
</ul>
|
| 74 |
-
</div>
|
| 75 |
-
|
| 76 |
-
<div class="bg-white rounded-lg p-6">
|
| 77 |
-
<h3 class="text-lg font-semibold text-gray-900 mb-3">βοΈ Technical Achievements</h3>
|
| 78 |
-
<ul class="text-sm text-gray-700 space-y-2">
|
| 79 |
-
<li>β’ <strong>Meta-Tool Architecture:</strong> Framework for creating specialized analysis tools</li>
|
| 80 |
-
<li>β’ <strong>JSON-Based Storage:</strong> Structured format enabling version control and API integration</li>
|
| 81 |
-
<li>β’ <strong>Multi-Domain Application:</strong> Successfully applied to biological processes (GLMP)</li>
|
| 82 |
-
<li>β’ <strong>Integration Framework:</strong> Designed for knowledge engines and collaborative platforms</li>
|
| 83 |
</ul>
|
| 84 |
</div>
|
| 85 |
</div>
|
| 86 |
-
|
| 87 |
-
<div class="bg-
|
| 88 |
-
<h3 class="text-
|
| 89 |
<p class="text-gray-700 mb-3">
|
| 90 |
-
The
|
| 91 |
</p>
|
| 92 |
-
<
|
| 93 |
-
|
| 94 |
-
<li>β’ GLMP (Genome Logic Modeling Project)</li>
|
| 95 |
-
<li>β’ CopernicusAI (main knowledge engine)</li>
|
| 96 |
-
<li>β’ Research Papers Metadata Database</li>
|
| 97 |
-
</ul>
|
| 98 |
-
<ul class="text-gray-700 space-y-1">
|
| 99 |
-
<li>β’ Science Video Database</li>
|
| 100 |
-
<li>β’ Multi-domain process analysis</li>
|
| 101 |
-
</ul>
|
| 102 |
-
</div>
|
| 103 |
-
<p class="text-gray-600 text-sm italic">
|
| 104 |
-
This work establishes a proof-of-concept for AI-assisted process analysis, demonstrating how LLMs can systematically extract and visualize complex logic from textual sources across diverse domains.
|
| 105 |
</p>
|
| 106 |
</div>
|
| 107 |
</div>
|
| 108 |
</section>
|
| 109 |
|
| 110 |
-
<!--
|
| 111 |
-
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 -
|
| 112 |
-
<div class="
|
| 113 |
-
<
|
| 114 |
-
|
| 115 |
-
|
| 116 |
-
|
| 117 |
-
|
| 118 |
-
|
| 119 |
-
|
| 120 |
-
|
| 121 |
-
|
| 122 |
-
|
| 123 |
-
|
| 124 |
-
|
| 125 |
-
|
| 126 |
-
|
| 127 |
-
<div class="text-sm text-gray-600">Structured Data</div>
|
| 128 |
-
</div>
|
| 129 |
</div>
|
| 130 |
</section>
|
| 131 |
|
| 132 |
-
<!--
|
| 133 |
-
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-
|
| 134 |
-
<div class="bg-
|
| 135 |
-
<h2 class="text-3xl font-bold text-gray-900 mb-6"
|
| 136 |
-
<
|
| 137 |
-
|
| 138 |
-
|
| 139 |
-
|
| 140 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 141 |
</p>
|
| 142 |
-
|
| 143 |
-
|
| 144 |
-
|
| 145 |
-
<
|
| 146 |
-
|
| 147 |
-
|
| 148 |
-
|
| 149 |
-
|
| 150 |
-
|
|
|
|
| 151 |
</div>
|
| 152 |
-
|
| 153 |
-
|
| 154 |
-
<
|
| 155 |
-
|
| 156 |
-
|
| 157 |
-
|
| 158 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 159 |
</p>
|
| 160 |
</div>
|
| 161 |
</div>
|
| 162 |
</div>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 163 |
</div>
|
| 164 |
</section>
|
| 165 |
|
| 166 |
-
<!--
|
| 167 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 168 |
-
<div class="
|
| 169 |
-
<
|
| 170 |
-
|
| 171 |
-
|
| 172 |
-
<div class="
|
| 173 |
-
|
| 174 |
-
|
| 175 |
-
|
| 176 |
-
</div>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 177 |
|
| 178 |
-
|
| 179 |
-
|
| 180 |
-
|
| 181 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 182 |
</div>
|
|
|
|
| 183 |
|
| 184 |
-
|
| 185 |
-
|
| 186 |
-
|
| 187 |
-
<
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 188 |
</div>
|
|
|
|
| 189 |
|
| 190 |
-
|
| 191 |
-
|
| 192 |
-
|
| 193 |
-
<
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 194 |
</div>
|
| 195 |
</div>
|
| 196 |
|
| 197 |
-
|
| 198 |
-
|
| 199 |
-
<div class="
|
| 200 |
-
<
|
| 201 |
-
<
|
| 202 |
-
"
|
| 203 |
-
|
| 204 |
-
|
| 205 |
-
|
| 206 |
-
|
| 207 |
-
|
| 208 |
-
|
| 209 |
-
|
| 210 |
-
|
| 211 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 212 |
</div>
|
| 213 |
</div>
|
| 214 |
|
| 215 |
-
|
| 216 |
-
|
| 217 |
-
<div
|
| 218 |
-
|
| 219 |
-
|
| 220 |
-
|
| 221 |
-
|
| 222 |
-
|
| 223 |
-
|
| 224 |
-
|
| 225 |
-
|
| 226 |
-
|
| 227 |
-
|
| 228 |
-
|
| 229 |
-
|
| 230 |
-
|
| 231 |
-
|
| 232 |
-
|
| 233 |
-
|
| 234 |
-
|
| 235 |
-
|
| 236 |
-
|
| 237 |
-
|
| 238 |
-
|
| 239 |
-
|
| 240 |
-
|
| 241 |
-
|
| 242 |
-
|
| 243 |
-
|
| 244 |
-
|
| 245 |
-
|
| 246 |
-
|
| 247 |
-
|
| 248 |
-
|
| 249 |
-
<span><span class="inline-block w-4 h-4 rounded" style="background-color: #ff6b6b;"></span> Red - Triggers & Inputs</span>
|
| 250 |
-
<span><span class="inline-block w-4 h-4 rounded" style="background-color: #ffd43b;"></span> Yellow - Structures & Objects</span>
|
| 251 |
-
<span><span class="inline-block w-4 h-4 rounded" style="background-color: #51cf66;"></span> Green - Processing & Operations</span>
|
| 252 |
-
<span><span class="inline-block w-4 h-4 rounded" style="background-color: #74c0fc;"></span> Blue - Intermediates & States</span>
|
| 253 |
-
<span><span class="inline-block w-4 h-4 rounded" style="background-color: #b197fc;"></span> Violet - Products & Outputs</span>
|
| 254 |
</div>
|
| 255 |
</div>
|
| 256 |
</div>
|
| 257 |
</div>
|
| 258 |
</section>
|
| 259 |
|
| 260 |
-
<!--
|
| 261 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 262 |
-
<
|
| 263 |
-
|
| 264 |
-
|
| 265 |
-
<div class="
|
| 266 |
-
<div class="
|
| 267 |
-
|
| 268 |
-
|
| 269 |
-
|
| 270 |
-
|
| 271 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 272 |
</div>
|
| 273 |
-
|
| 274 |
-
<div class="bg-
|
| 275 |
-
<
|
| 276 |
-
<
|
| 277 |
-
|
| 278 |
-
Start with rough analysis, visualize, identify gaps, refine with LLM, repeat until
|
| 279 |
-
the process logic is crystal clear.
|
| 280 |
</p>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 281 |
</div>
|
| 282 |
-
|
| 283 |
-
<div class="bg-
|
| 284 |
-
<
|
| 285 |
-
<
|
| 286 |
-
|
| 287 |
-
JSON storage enables programmatic access, version control, cross-referencing,
|
| 288 |
-
and integration with other tools and databases.
|
| 289 |
</p>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 290 |
</div>
|
| 291 |
</div>
|
| 292 |
</section>
|
| 293 |
|
| 294 |
-
|
| 295 |
-
<!-- Process Diagram Collections -->
|
| 296 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 297 |
-
<div class="bg-
|
| 298 |
-
<h2 class="text-3xl font-bold text-gray-900 mb-6"
|
| 299 |
-
<p class="text-gray-700 mb-6">
|
| 300 |
-
The Programming Framework has been applied across multiple scientific disciplines. Explore interactive flowchart collections organized by domain:
|
| 301 |
-
</p>
|
| 302 |
|
| 303 |
-
<div class="grid md:grid-cols-
|
| 304 |
-
|
| 305 |
-
|
| 306 |
-
<
|
| 307 |
-
<
|
| 308 |
-
|
| 309 |
-
|
| 310 |
-
|
| 311 |
-
|
| 312 |
-
|
| 313 |
-
<
|
| 314 |
-
|
| 315 |
-
|
| 316 |
-
ποΈ Biology Processes Database β
|
| 317 |
-
</a>
|
| 318 |
-
<a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/glmp-database-table.html"
|
| 319 |
-
class="text-orange-600 hover:text-orange-700 font-medium text-sm inline-block"
|
| 320 |
-
target="_blank" rel="noopener noreferrer">
|
| 321 |
-
ποΈ GLMP Database Table β
|
| 322 |
-
</a>
|
| 323 |
-
</div>
|
| 324 |
-
<p class="text-xs text-gray-500 mt-2">
|
| 325 |
-
Biology Database: 52 processes (organismal/ecological) | GLMP: 50+ processes (biochemical/molecular)
|
| 326 |
-
</p>
|
| 327 |
-
</div>
|
| 328 |
-
|
| 329 |
-
<!-- Chemistry -->
|
| 330 |
-
<div class="bg-white rounded-lg shadow-md p-6 card-hover">
|
| 331 |
-
<h3 class="text-xl font-semibold text-gray-900 mb-4 flex items-center">
|
| 332 |
-
<span class="text-2xl mr-2">βοΈ</span> Chemistry
|
| 333 |
-
</h3>
|
| 334 |
-
<p class="text-gray-600 text-sm mb-3">
|
| 335 |
-
Comprehensive chemistry process diagrams across all major branches.
|
| 336 |
-
</p>
|
| 337 |
-
<a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/chemistry-processes-database/chemistry-database-table.html"
|
| 338 |
-
class="text-orange-600 hover:text-orange-700 font-medium text-sm inline-block"
|
| 339 |
-
target="_blank" rel="noopener noreferrer">
|
| 340 |
-
ποΈ Chemistry Database Table β
|
| 341 |
-
</a>
|
| 342 |
-
<p class="text-xs text-gray-500 mt-2">
|
| 343 |
-
56 processes across 14 subcategories
|
| 344 |
-
</p>
|
| 345 |
-
</div>
|
| 346 |
-
|
| 347 |
-
<!-- Mathematics -->
|
| 348 |
-
<div class="bg-white rounded-lg shadow-md p-6 card-hover">
|
| 349 |
-
<h3 class="text-xl font-semibold text-gray-900 mb-4 flex items-center">
|
| 350 |
-
<span class="text-2xl mr-2">π’</span> Mathematics
|
| 351 |
-
</h3>
|
| 352 |
-
<p class="text-gray-600 text-sm mb-3">
|
| 353 |
-
Mathematical algorithms, proof methods, and computational processes.
|
| 354 |
-
</p>
|
| 355 |
-
<a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/mathematics-processes-database/mathematics-database-table.html"
|
| 356 |
-
class="text-orange-600 hover:text-orange-700 font-medium text-sm inline-block"
|
| 357 |
-
target="_blank" rel="noopener noreferrer">
|
| 358 |
-
ποΈ Mathematics Database Table β
|
| 359 |
-
</a>
|
| 360 |
-
<p class="text-xs text-gray-500 mt-2">
|
| 361 |
-
20 processes across 7 subcategories
|
| 362 |
-
</p>
|
| 363 |
</div>
|
| 364 |
-
|
| 365 |
-
|
| 366 |
-
|
| 367 |
-
<
|
| 368 |
-
<
|
| 369 |
-
|
| 370 |
-
|
| 371 |
-
|
| 372 |
-
|
| 373 |
-
|
| 374 |
-
|
| 375 |
-
|
| 376 |
-
ποΈ Physics Database Table β
|
| 377 |
-
</a>
|
| 378 |
-
<p class="text-xs text-gray-500 mt-2">
|
| 379 |
-
21 processes across 7 subcategories
|
| 380 |
-
</p>
|
| 381 |
</div>
|
| 382 |
-
|
| 383 |
-
|
| 384 |
-
|
| 385 |
-
<
|
| 386 |
-
<
|
| 387 |
-
|
| 388 |
-
|
| 389 |
-
|
| 390 |
-
</
|
| 391 |
-
<a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/computer-science-processes-database/computer-science-database-table.html"
|
| 392 |
-
class="text-orange-600 hover:text-orange-700 font-medium text-sm inline-block"
|
| 393 |
-
target="_blank" rel="noopener noreferrer">
|
| 394 |
-
ποΈ Computer Science Database Table β
|
| 395 |
-
</a>
|
| 396 |
-
<p class="text-xs text-gray-500 mt-2">
|
| 397 |
-
21 processes across 7 subcategories
|
| 398 |
-
</p>
|
| 399 |
</div>
|
| 400 |
</div>
|
| 401 |
</div>
|
| 402 |
</section>
|
| 403 |
|
| 404 |
-
<!--
|
| 405 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 406 |
-
<div class="bg-
|
| 407 |
-
<h2 class="text-3xl font-bold mb-6"
|
| 408 |
|
| 409 |
-
<div class="grid md:grid-cols-2 gap-
|
| 410 |
-
<div>
|
| 411 |
-
<h3 class="text-xl font-semibold text-
|
| 412 |
-
<ul class="
|
| 413 |
-
<li>β’
|
| 414 |
-
<li>β’
|
| 415 |
-
<li>β’
|
| 416 |
-
<li>β’
|
|
|
|
| 417 |
</ul>
|
| 418 |
</div>
|
| 419 |
-
|
| 420 |
-
<div>
|
| 421 |
-
<h3 class="text-xl font-semibold text-
|
| 422 |
-
<ul class="
|
| 423 |
-
<li>β’
|
| 424 |
-
<li>β’
|
| 425 |
-
<li>β’
|
| 426 |
-
<li>β’
|
|
|
|
|
|
|
| 427 |
</ul>
|
| 428 |
</div>
|
|
|
|
|
|
|
|
|
|
| 429 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 430 |
<div>
|
| 431 |
-
<h3 class="text-xl font-semibold text-
|
| 432 |
-
<
|
| 433 |
-
|
| 434 |
-
|
| 435 |
-
|
| 436 |
-
<li>β’
|
|
|
|
|
|
|
|
|
|
| 437 |
</ul>
|
|
|
|
|
|
|
|
|
|
|
|
|
| 438 |
</div>
|
| 439 |
-
|
| 440 |
<div>
|
| 441 |
-
<h3 class="text-xl font-semibold text-
|
| 442 |
-
<ul class="space-y-2
|
| 443 |
-
<li>β’
|
| 444 |
-
<li>β’
|
| 445 |
-
<li>β’
|
| 446 |
-
<li>β’
|
|
|
|
|
|
|
| 447 |
</ul>
|
| 448 |
</div>
|
| 449 |
</div>
|
| 450 |
</div>
|
| 451 |
</section>
|
| 452 |
|
| 453 |
-
<!--
|
| 454 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 455 |
-
<
|
| 456 |
-
|
| 457 |
-
|
| 458 |
-
<div class="bg-white rounded-lg
|
| 459 |
-
<h3 class="text-xl font-semibold text-gray-900 mb-
|
| 460 |
-
<p class="text-gray-
|
| 461 |
-
|
| 462 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 463 |
</p>
|
| 464 |
-
<
|
| 465 |
-
|
| 466 |
-
|
| 467 |
-
|
| 468 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 469 |
</div>
|
| 470 |
|
| 471 |
-
<div class="bg-
|
| 472 |
-
<h3 class="text-
|
| 473 |
-
<p class="text-gray-
|
| 474 |
-
|
| 475 |
-
and knowledge graph for scientific discovery.
|
| 476 |
</p>
|
| 477 |
-
<
|
| 478 |
-
|
| 479 |
-
|
| 480 |
-
|
| 481 |
-
</
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 482 |
</div>
|
| 483 |
</div>
|
| 484 |
</section>
|
|
@@ -487,37 +732,239 @@
|
|
| 487 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 488 |
<div class="bg-white rounded-xl shadow-lg p-8">
|
| 489 |
<h2 class="text-3xl font-bold text-gray-900 mb-6">How to Cite This Work</h2>
|
| 490 |
-
<div class="bg-gray-50 rounded-lg p-6 mb-4
|
| 491 |
-
<
|
| 492 |
-
<
|
| 493 |
-
|
| 494 |
-
|
| 495 |
-
|
| 496 |
-
|
| 497 |
-
|
| 498 |
-
|
| 499 |
-
title={
|
| 500 |
author={Welz, Gary},
|
| 501 |
year={2024--2025},
|
| 502 |
-
url={https://huggingface.co/spaces/garywelz/
|
| 503 |
-
note={Hugging Face
|
| 504 |
}</code></pre>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 505 |
</div>
|
| 506 |
</div>
|
| 507 |
-
|
| 508 |
-
|
| 509 |
-
|
| 510 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 511 |
</p>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 512 |
</div>
|
| 513 |
</div>
|
| 514 |
-
|
| 515 |
-
|
| 516 |
-
|
| 517 |
-
|
| 518 |
-
|
| 519 |
-
|
| 520 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 521 |
</div>
|
| 522 |
</div>
|
| 523 |
</section>
|
|
@@ -525,15 +972,10 @@
|
|
| 525 |
<!-- Footer -->
|
| 526 |
<footer class="gradient-bg text-white py-8 mt-12">
|
| 527 |
<div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 text-center">
|
| 528 |
-
<p class="text-lg font-semibold mb-2">
|
| 529 |
-
<p class="text-sm opacity-75">
|
| 530 |
-
<p class="text-xs opacity-50 mt-4">© 2025
|
| 531 |
</div>
|
| 532 |
</footer>
|
| 533 |
-
|
| 534 |
-
<script>
|
| 535 |
-
mermaid.initialize({ startOnLoad: true, theme: 'default' });
|
| 536 |
-
</script>
|
| 537 |
</body>
|
| 538 |
</html>
|
| 539 |
-
|
|
|
|
| 3 |
<head>
|
| 4 |
<meta charset="UTF-8">
|
| 5 |
<meta name="viewport" content="width=device-width, initial-scale=1.0">
|
| 6 |
+
<title>CopernicusAI - Research-Driven Podcast Generation Platform</title>
|
| 7 |
<script src="https://cdn.tailwindcss.com"></script>
|
|
|
|
| 8 |
<style>
|
| 9 |
.gradient-bg {
|
| 10 |
+
background: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
|
| 11 |
}
|
| 12 |
.card-hover {
|
| 13 |
transition: transform 0.3s ease, box-shadow 0.3s ease;
|
|
|
|
| 16 |
transform: translateY(-4px);
|
| 17 |
box-shadow: 0 20px 40px rgba(0,0,0,0.15);
|
| 18 |
}
|
| 19 |
+
.stat-number {
|
| 20 |
+
font-size: 2.5rem;
|
| 21 |
+
font-weight: bold;
|
| 22 |
+
background: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
|
| 23 |
+
-webkit-background-clip: text;
|
| 24 |
+
-webkit-text-fill-color: transparent;
|
| 25 |
+
}
|
| 26 |
</style>
|
| 27 |
</head>
|
| 28 |
<body class="bg-gray-50">
|
|
|
|
| 30 |
<header class="gradient-bg text-white">
|
| 31 |
<div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-16">
|
| 32 |
<div class="text-center">
|
| 33 |
+
<div class="text-6xl mb-4">π¬</div>
|
| 34 |
+
<h1 class="text-5xl font-bold mb-4">CopernicusAI</h1>
|
| 35 |
+
<p class="text-xl opacity-90 mb-6">Knowledge Engine for Scientific Discovery</p>
|
| 36 |
+
<p class="text-lg opacity-75 max-w-4xl mx-auto">
|
| 37 |
+
A collaborative research platform that transforms cutting-edge scientific research into accessible,
|
| 38 |
+
multi-format tools for collective knowledge exploration. These are research instrumentsβlike microscopes
|
| 39 |
+
for observing the collective knowledge of humanityβenabling hypothesis formation, testing, and discovery
|
| 40 |
+
across scientific disciplines.
|
| 41 |
</p>
|
| 42 |
</div>
|
| 43 |
</div>
|
|
|
|
| 45 |
|
| 46 |
<!-- Abstract/Summary -->
|
| 47 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
|
| 48 |
+
<div class="bg-white rounded-xl shadow-lg p-8 mb-8 border-l-4 border-purple-600">
|
| 49 |
<h2 class="text-2xl font-bold text-gray-900 mb-4">π Summary</h2>
|
| 50 |
<p class="text-lg text-gray-700 leading-relaxed mb-3">
|
| 51 |
+
<strong>CopernicusAI</strong> is an operational research platform that synthesizes scientific literature from 250+ million papers into AI-generated podcasts, integrates with a knowledge graph of 23,246 indexed papers, and provides collaborative tools for research discovery. The system demonstrates production-ready multi-source research synthesis with full citation tracking and evidence-based content generation requiring minimum 3 research sources per episode.
|
| 52 |
</p>
|
| 53 |
<p class="text-gray-600">
|
| 54 |
+
The platform includes a fully operational Knowledge Engine Dashboard (deployed December 2025) with interactive knowledge graph visualization, vector search, and RAG capabilities, enabling researchers to explore, query, and synthesize scientific knowledge across disciplines.
|
| 55 |
</p>
|
| 56 |
</div>
|
| 57 |
</section>
|
| 58 |
|
| 59 |
+
<!-- Prior Work: CopernicusAI Research Interface -->
|
| 60 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
|
| 61 |
+
<div class="bg-gradient-to-r from-purple-50 to-blue-50 rounded-xl shadow-lg p-8 mb-8">
|
| 62 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-4">Prior Work & Current Status</h2>
|
| 63 |
|
| 64 |
<div class="bg-white rounded-lg p-6 mb-6">
|
| 65 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-3">Prior Work (2024-2025)</h3>
|
| 66 |
+
<p class="text-lg text-gray-700 leading-relaxed mb-4">
|
| 67 |
+
CopernicusAI is an active research prototype exploring AI-generated audio briefings as an interface for assisted scientific research.
|
| 68 |
+
</p>
|
| 69 |
<p class="text-gray-700 mb-4">
|
| 70 |
+
The system allows any user to generate, refine, and share AI-generated science podcasts based on structured prompts, enabling rapid orientation to a topic, iterative deepening, and personalized research briefings.
|
| 71 |
</p>
|
| 72 |
+
<p class="text-gray-700 mb-4">
|
| 73 |
+
Rather than functioning as a static content platform, CopernicusAI supports collectively generated and shared research artifacts, analogous to community-driven knowledge platforms (e.g., discussion forums), but grounded in scientific sources and metadata-aware workflows.
|
| 74 |
+
</p>
|
| 75 |
+
<div class="bg-blue-50 rounded-lg p-4 mt-4">
|
| 76 |
+
<h3 class="font-semibold text-gray-900 mb-2">This work demonstrates technical feasibility for:</h3>
|
| 77 |
+
<ul class="text-gray-700 space-y-1">
|
| 78 |
+
<li>β’ AI-assisted research briefing and orientation</li>
|
| 79 |
+
<li>β’ Iterative question refinement via conversational interfaces</li>
|
| 80 |
+
<li>β’ Integration of text, audio, and metadata in research workflows</li>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 81 |
</ul>
|
| 82 |
</div>
|
| 83 |
</div>
|
| 84 |
+
|
| 85 |
+
<div class="bg-green-50 border-2 border-green-200 rounded-lg p-6">
|
| 86 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-3">Current Implementation (December 2025)</h3>
|
| 87 |
<p class="text-gray-700 mb-3">
|
| 88 |
+
The Knowledge Engine Dashboard is <strong>fully operational</strong> and deployed to Google Cloud Run, providing unified access to all components with interactive knowledge graph visualization, vector search, RAG queries, and content browsing.
|
| 89 |
</p>
|
| 90 |
+
<p class="text-sm text-gray-600">
|
| 91 |
+
See the "Knowledge Engine Ecosystem" section below for details.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 92 |
</p>
|
| 93 |
</div>
|
| 94 |
</div>
|
| 95 |
</section>
|
| 96 |
|
| 97 |
+
<!-- Mission & Vision -->
|
| 98 |
+
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
|
| 99 |
+
<div class="bg-white rounded-xl shadow-lg p-8 mb-8">
|
| 100 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-4">π― Mission & Vision</h2>
|
| 101 |
+
<p class="text-lg text-gray-700 leading-relaxed mb-4">
|
| 102 |
+
Inspired by Nicolaus Copernicus who challenged accepted knowledge with evidence and rigorous analysis,
|
| 103 |
+
<strong>CopernicusAI</strong> creates collaborative research tools that enable collective participation in
|
| 104 |
+
scientific discovery. These platforms are instruments for exploring humanity's collective knowledgeβtools for
|
| 105 |
+
hypothesis formation, testing, and collaborative research, not just educational content.
|
| 106 |
+
</p>
|
| 107 |
+
<p class="text-gray-600">
|
| 108 |
+
Just as a microscope enables observation of the microscopic world, CopernicusAI tools enable observation and
|
| 109 |
+
exploration of humanity's collective knowledge. Subscribers collaborate to prompt, generate, and refine research
|
| 110 |
+
contentβsharing discoveries publicly or keeping them private. As large language models (LLMs) and AI systems
|
| 111 |
+
gain unprecedented knowledge, CopernicusAI provides the infrastructure for human-AI collaborative knowledge
|
| 112 |
+
exploration, with evidence-based truth-seeking as our guiding principle.
|
| 113 |
+
</p>
|
|
|
|
|
|
|
| 114 |
</div>
|
| 115 |
</section>
|
| 116 |
|
| 117 |
+
<!-- Knowledge Engine Ecosystem -->
|
| 118 |
+
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 119 |
+
<div class="bg-gradient-to-r from-purple-50 to-indigo-50 rounded-xl shadow-lg p-8 mb-8">
|
| 120 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-6 text-center">π§© CopernicusAI Knowledge Engine</h2>
|
| 121 |
+
<p class="text-lg text-gray-700 mb-6 text-center max-w-4xl mx-auto">
|
| 122 |
+
An integrated ecosystem of research and collaboration tools designed to assist scientists in their workflow,
|
| 123 |
+
from research discovery through knowledge synthesis to multi-format content generation.
|
| 124 |
+
</p>
|
| 125 |
+
|
| 126 |
+
<!-- Knowledge Engine Implementation Status -->
|
| 127 |
+
<div class="bg-green-50 border-2 border-green-200 rounded-xl p-6 mb-6">
|
| 128 |
+
<div class="flex items-center mb-3">
|
| 129 |
+
<span class="text-3xl mr-3">β
</span>
|
| 130 |
+
<h3 class="text-xl font-bold text-gray-900">Knowledge Engine Dashboard - Fully Operational (December 2025)</h3>
|
| 131 |
+
</div>
|
| 132 |
+
<p class="text-gray-700 mb-4">
|
| 133 |
+
The Knowledge Engine is now <strong>fully implemented and deployed</strong> with a working web dashboard providing unified access to all components.
|
| 134 |
</p>
|
| 135 |
+
<div class="grid md:grid-cols-2 gap-4 mb-4">
|
| 136 |
+
<div>
|
| 137 |
+
<h4 class="font-semibold text-gray-800 mb-2">Key Features:</h4>
|
| 138 |
+
<ul class="text-sm text-gray-700 space-y-1">
|
| 139 |
+
<li>β Interactive Knowledge Graph (23,246 papers)</li>
|
| 140 |
+
<li>β Vector Search (semantic similarity)</li>
|
| 141 |
+
<li>β RAG System (with citations)</li>
|
| 142 |
+
<li>β Content Browsing (papers, podcasts, processes)</li>
|
| 143 |
+
<li>β Statistics Dashboard</li>
|
| 144 |
+
</ul>
|
| 145 |
</div>
|
| 146 |
+
<div>
|
| 147 |
+
<h4 class="font-semibold text-gray-800 mb-2">Live System:</h4>
|
| 148 |
+
<p class="text-sm text-gray-700 mb-2">
|
| 149 |
+
<a href="https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine"
|
| 150 |
+
target="_blank" rel="noopener noreferrer"
|
| 151 |
+
class="text-blue-600 hover:underline font-medium">
|
| 152 |
+
https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine (opens in new tab)
|
| 153 |
+
</a>
|
| 154 |
+
</p>
|
| 155 |
+
<p class="text-xs text-gray-600">
|
| 156 |
+
Fully deployed to Google Cloud Run, accessible 24/7
|
| 157 |
</p>
|
| 158 |
</div>
|
| 159 |
</div>
|
| 160 |
</div>
|
| 161 |
+
|
| 162 |
+
<div class="grid md:grid-cols-2 lg:grid-cols-3 gap-6 mb-6">
|
| 163 |
+
<div class="bg-white rounded-lg p-6 card-hover">
|
| 164 |
+
<div class="text-3xl mb-3">π¬</div>
|
| 165 |
+
<h3 class="text-lg font-semibold text-gray-900 mb-2">CopernicusAI</h3>
|
| 166 |
+
<p class="text-sm text-gray-600 mb-3">Core synthesis & distribution platform for AI-powered research and podcast generation</p>
|
| 167 |
+
<p class="text-xs text-gray-500 italic">This platform</p>
|
| 168 |
+
</div>
|
| 169 |
+
|
| 170 |
+
<div class="bg-white rounded-lg p-6 card-hover border-2 border-green-300">
|
| 171 |
+
<div class="text-3xl mb-3">πΊοΈ</div>
|
| 172 |
+
<h3 class="text-lg font-semibold text-gray-900 mb-2">Knowledge Engine Dashboard</h3>
|
| 173 |
+
<p class="text-sm text-gray-600 mb-3">β
Fully operational web interface with knowledge graph, vector search, RAG queries, and content browsing</p>
|
| 174 |
+
<a href="https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine" target="_blank" class="text-xs text-blue-600 hover:underline">Live System β</a>
|
| 175 |
+
</div>
|
| 176 |
+
|
| 177 |
+
<div class="bg-white rounded-lg p-6 card-hover">
|
| 178 |
+
<div class="text-3xl mb-3">π οΈ</div>
|
| 179 |
+
<h3 class="text-lg font-semibold text-gray-900 mb-2">Programming Framework</h3>
|
| 180 |
+
<p class="text-sm text-gray-600 mb-3">Foundational meta-tool for universal process analysis across disciplines</p>
|
| 181 |
+
<a href="https://huggingface.co/spaces/garywelz/programming_framework" target="_blank" class="text-xs text-blue-600 hover:underline">Explore β</a>
|
| 182 |
+
</div>
|
| 183 |
+
|
| 184 |
+
<div class="bg-white rounded-lg p-6 card-hover">
|
| 185 |
+
<div class="text-3xl mb-3">π§¬</div>
|
| 186 |
+
<h3 class="text-lg font-semibold text-gray-900 mb-2">GLMP</h3>
|
| 187 |
+
<p class="text-sm text-gray-600 mb-3">Biological process visualization - 50+ processes mapped</p>
|
| 188 |
+
<a href="https://huggingface.co/spaces/garywelz/glmp" target="_blank" class="text-xs text-blue-600 hover:underline">Explore β</a>
|
| 189 |
+
</div>
|
| 190 |
+
|
| 191 |
+
<div class="bg-white rounded-lg p-6 card-hover">
|
| 192 |
+
<div class="text-3xl mb-3">π</div>
|
| 193 |
+
<h3 class="text-lg font-semibold text-gray-900 mb-2">Metadata Database</h3>
|
| 194 |
+
<p class="text-sm text-gray-600 mb-3">Core data infrastructure for research paper metadata and citation networks</p>
|
| 195 |
+
<a href="https://huggingface.co/spaces/garywelz/metadata_database" target="_blank" class="text-xs text-blue-600 hover:underline">Explore β</a>
|
| 196 |
+
</div>
|
| 197 |
+
|
| 198 |
+
<div class="bg-white rounded-lg p-6 card-hover">
|
| 199 |
+
<div class="text-3xl mb-3">π¬</div>
|
| 200 |
+
<h3 class="text-lg font-semibold text-gray-900 mb-2">Video Database</h3>
|
| 201 |
+
<p class="text-sm text-gray-600 mb-3">Multi-modal content with transcript-based search for scientific videos</p>
|
| 202 |
+
<a href="https://huggingface.co/spaces/garywelz/sciencevideodb" target="_blank" class="text-xs text-blue-600 hover:underline">Explore β</a>
|
| 203 |
+
</div>
|
| 204 |
+
|
| 205 |
+
<div class="bg-white rounded-lg p-6 card-hover border-2 border-dashed border-gray-300">
|
| 206 |
+
<div class="text-3xl mb-3 opacity-50">β</div>
|
| 207 |
+
<h3 class="text-lg font-semibold text-gray-900 mb-2">Future Components</h3>
|
| 208 |
+
<p class="text-sm text-gray-600 mb-3">Additional tools, databases, and collaboration features will be added as the project develops</p>
|
| 209 |
+
</div>
|
| 210 |
+
</div>
|
| 211 |
</div>
|
| 212 |
</section>
|
| 213 |
|
| 214 |
+
<!-- Key Statistics -->
|
| 215 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 216 |
+
<div class="grid md:grid-cols-2 lg:grid-cols-4 gap-6 mb-12">
|
| 217 |
+
<div class="bg-white rounded-lg shadow-md p-6 text-center">
|
| 218 |
+
<div class="stat-number mb-2">23,246</div>
|
| 219 |
+
<div class="text-gray-600 font-semibold">Research Papers</div>
|
| 220 |
+
<div class="text-sm text-gray-500 mt-1">Indexed in Knowledge Engine (As of January 2025)</div>
|
| 221 |
+
</div>
|
| 222 |
+
<div class="bg-white rounded-lg shadow-md p-6 text-center">
|
| 223 |
+
<div class="stat-number mb-2">314</div>
|
| 224 |
+
<div class="text-gray-600 font-semibold">Processes</div>
|
| 225 |
+
<div class="text-sm text-gray-500 mt-1">Visualized across 6 databases (As of January 2025)</div>
|
| 226 |
+
</div>
|
| 227 |
+
<div class="bg-white rounded-lg shadow-md p-6 text-center">
|
| 228 |
+
<div class="stat-number mb-2">753</div>
|
| 229 |
+
<div class="text-gray-600 font-semibold">Videos</div>
|
| 230 |
+
<div class="text-sm text-gray-500 mt-1">Science videos indexed (As of January 2025)</div>
|
| 231 |
+
</div>
|
| 232 |
+
<div class="bg-white rounded-lg shadow-md p-6 text-center">
|
| 233 |
+
<div class="stat-number mb-2">79</div>
|
| 234 |
+
<div class="text-gray-600 font-semibold">Podcasts</div>
|
| 235 |
+
<div class="text-sm text-gray-500 mt-1">Generated across 5 disciplines (As of January 2025)</div>
|
| 236 |
+
</div>
|
| 237 |
+
</div>
|
| 238 |
+
</section>
|
| 239 |
|
| 240 |
+
<!-- Core Platform Capabilities -->
|
| 241 |
+
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 242 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-8 text-center">π Core Platform Capabilities</h2>
|
| 243 |
+
|
| 244 |
+
<div class="space-y-8">
|
| 245 |
+
<!-- AI Podcast Generation -->
|
| 246 |
+
<div class="bg-white rounded-xl shadow-lg p-8">
|
| 247 |
+
<div class="flex items-start mb-4">
|
| 248 |
+
<span class="text-4xl mr-4">ποΈ</span>
|
| 249 |
+
<div class="flex-1">
|
| 250 |
+
<h3 class="text-2xl font-bold text-gray-900 mb-3">AI-Powered Podcast Generation</h3>
|
| 251 |
+
<p class="text-gray-600 mb-4">
|
| 252 |
+
Collaborative research platform where subscribers prompt and generate multi-voice AI podcasts
|
| 253 |
+
(5-10 minutes) synthesizing research from multiple academic sources. Subscribers can share their
|
| 254 |
+
podcasts publicly or keep them private. Evidence-based content generation requiring minimum 3
|
| 255 |
+
research sources per episode.
|
| 256 |
+
</p>
|
| 257 |
+
<div class="grid md:grid-cols-2 gap-4 mt-4">
|
| 258 |
+
<div>
|
| 259 |
+
<h4 class="font-semibold text-gray-800 mb-2">Key Features:</h4>
|
| 260 |
+
<ul class="text-sm text-gray-600 space-y-1">
|
| 261 |
+
<li>β Comprehensive research integration (8+ databases)</li>
|
| 262 |
+
<li>β Professional multi-speaker dialogue</li>
|
| 263 |
+
<li>β AI-generated scientific visualizations</li>
|
| 264 |
+
<li>β RSS feed distribution</li>
|
| 265 |
+
<li>β Quality scoring & relevance ranking</li>
|
| 266 |
+
<li>β Paradigm shift identification</li>
|
| 267 |
+
</ul>
|
| 268 |
+
</div>
|
| 269 |
+
<div>
|
| 270 |
+
<h4 class="font-semibold text-gray-800 mb-2">Research Integration:</h4>
|
| 271 |
+
<ul class="text-sm text-gray-600 space-y-1">
|
| 272 |
+
<li>β Real-time discovery from 8+ APIs</li>
|
| 273 |
+
<li>β Parallel search across databases</li>
|
| 274 |
+
<li>β Automatic citation extraction</li>
|
| 275 |
+
<li>β Source validation & verification</li>
|
| 276 |
+
<li>β Interdisciplinary connection analysis</li>
|
| 277 |
+
</ul>
|
| 278 |
+
</div>
|
| 279 |
+
</div>
|
| 280 |
+
</div>
|
| 281 |
</div>
|
| 282 |
+
</div>
|
| 283 |
|
| 284 |
+
<!-- LLM Integration -->
|
| 285 |
+
<div class="bg-white rounded-xl shadow-lg p-8">
|
| 286 |
+
<div class="flex items-start mb-4">
|
| 287 |
+
<span class="text-4xl mr-4">π€</span>
|
| 288 |
+
<div class="flex-1">
|
| 289 |
+
<h3 class="text-2xl font-bold text-gray-900 mb-3">Advanced LLM Integration</h3>
|
| 290 |
+
<p class="text-gray-600 mb-4">Multi-model architecture with intelligent model selection:</p>
|
| 291 |
+
<div class="grid md:grid-cols-2 gap-4">
|
| 292 |
+
<div>
|
| 293 |
+
<h4 class="font-semibold text-gray-800 mb-2">Primary Models:</h4>
|
| 294 |
+
<ul class="text-sm text-gray-600 space-y-1">
|
| 295 |
+
<li>β’ <strong>Google Gemini 3</strong> - Latest research analysis and content generation</li>
|
| 296 |
+
<li>β’ <strong>OpenAI GPT-4/GPT-3.5</strong> - Content synthesis and quality validation</li>
|
| 297 |
+
<li>β’ <strong>Anthropic Claude 3</strong> (Sonnet, Haiku) - Alternative reasoning paths</li>
|
| 298 |
+
<li>β’ <strong>ElevenLabs TTS</strong> - Multi-voice text-to-speech synthesis</li>
|
| 299 |
+
</ul>
|
| 300 |
+
</div>
|
| 301 |
+
<div>
|
| 302 |
+
<h4 class="font-semibold text-gray-800 mb-2">Capabilities:</h4>
|
| 303 |
+
<ul class="text-sm text-gray-600 space-y-1">
|
| 304 |
+
<li>β’ Multi-paper analysis & synthesis</li>
|
| 305 |
+
<li>β’ Paradigm shift detection</li>
|
| 306 |
+
<li>β’ Entity extraction (genes, proteins, compounds)</li>
|
| 307 |
+
<li>β’ Citation tracking & cross-references</li>
|
| 308 |
+
<li>β’ Content quality scoring</li>
|
| 309 |
+
</ul>
|
| 310 |
+
</div>
|
| 311 |
+
</div>
|
| 312 |
+
</div>
|
| 313 |
</div>
|
| 314 |
+
</div>
|
| 315 |
|
| 316 |
+
<!-- Research Resources -->
|
| 317 |
+
<div class="bg-white rounded-xl shadow-lg p-8">
|
| 318 |
+
<div class="flex items-start mb-4">
|
| 319 |
+
<span class="text-4xl mr-4">π</span>
|
| 320 |
+
<div class="flex-1">
|
| 321 |
+
<h3 class="text-2xl font-bold text-gray-900 mb-3">Research Resource Access</h3>
|
| 322 |
+
<p class="text-gray-600 mb-4">
|
| 323 |
+
Comprehensive academic database coverage with <strong>250+ million research papers</strong> accessible
|
| 324 |
+
through integrated APIs.
|
| 325 |
+
</p>
|
| 326 |
+
<div class="grid md:grid-cols-2 gap-4">
|
| 327 |
+
<div>
|
| 328 |
+
<h4 class="font-semibold text-gray-800 mb-2">Academic Databases:</h4>
|
| 329 |
+
<ul class="text-sm text-gray-600 space-y-1">
|
| 330 |
+
<li>β’ PubMed/NCBI (~30+ million papers)</li>
|
| 331 |
+
<li>β’ arXiv (~2+ million preprints)</li>
|
| 332 |
+
<li>β’ NASA ADS (~15+ million papers)</li>
|
| 333 |
+
<li>β’ Zenodo (100K+ datasets)</li>
|
| 334 |
+
<li>β’ bioRxiv/medRxiv (preprints)</li>
|
| 335 |
+
<li>β’ CORE (~200+ million papers)</li>
|
| 336 |
+
<li>β’ Google Scholar (comprehensive)</li>
|
| 337 |
+
<li>β’ News API (current events)</li>
|
| 338 |
+
<li>β’ YouTube Data API (academic videos)</li>
|
| 339 |
+
</ul>
|
| 340 |
+
</div>
|
| 341 |
+
</div>
|
| 342 |
+
</div>
|
| 343 |
</div>
|
| 344 |
</div>
|
| 345 |
|
| 346 |
+
<!-- Audio and Video Podcast Production -->
|
| 347 |
+
<div class="bg-white rounded-xl shadow-lg p-8">
|
| 348 |
+
<div class="flex items-start mb-4">
|
| 349 |
+
<span class="text-4xl mr-4">ποΈ</span>
|
| 350 |
+
<div class="flex-1">
|
| 351 |
+
<h3 class="text-2xl font-bold text-gray-900 mb-3">Audio and Video Podcast Production</h3>
|
| 352 |
+
<p class="text-gray-600 mb-4">
|
| 353 |
+
<strong>Operating Audio Podcast System:</strong> Full production and distribution platform for subscriber-generated
|
| 354 |
+
podcasts. Users can prompt, generate, publish, and distribute audio podcasts with RSS feed support for
|
| 355 |
+
Spotify, Apple Podcasts, and Google Podcasts.
|
| 356 |
+
</p>
|
| 357 |
+
<div class="bg-green-50 rounded-lg p-4 mb-4">
|
| 358 |
+
<h4 class="font-semibold text-gray-800 mb-2">Current Audio Capabilities (Operational):</h4>
|
| 359 |
+
<ul class="text-sm text-gray-700 space-y-1">
|
| 360 |
+
<li>β Multi-voice AI podcast generation</li>
|
| 361 |
+
<li>β Research-driven content creation</li>
|
| 362 |
+
<li>β RSS feed distribution</li>
|
| 363 |
+
<li>β Public and private podcast options</li>
|
| 364 |
+
<li>β Professional audio quality</li>
|
| 365 |
+
</ul>
|
| 366 |
+
</div>
|
| 367 |
+
<div class="bg-blue-50 rounded-lg p-4 mt-4">
|
| 368 |
+
<h4 class="font-semibold text-gray-800 mb-2">Video Production (Future - Phase 2+):</h4>
|
| 369 |
+
<p class="text-sm text-gray-700 mb-2">Advanced video features planned for future development:</p>
|
| 370 |
+
<ul class="text-sm text-gray-700 space-y-2">
|
| 371 |
+
<li>β’ <strong>Visual Content Integration:</strong> Automated extraction from papers, web scraping, JSON database integration</li>
|
| 372 |
+
<li>β’ <strong>Dynamic Visualizations:</strong> Scientific animations, real-time charts, LaTeX rendering</li>
|
| 373 |
+
<li>β’ <strong>External Video Quoting:</strong> YouTube segment extraction with attribution & fair use compliance</li>
|
| 374 |
+
<li>β’ <strong>Advanced Composition:</strong> Multi-layer video, auto subtitles, text overlays, professional transitions</li>
|
| 375 |
+
</ul>
|
| 376 |
+
<p class="text-xs text-gray-600 mt-2">
|
| 377 |
+
See: <a href="https://huggingface.co/spaces/garywelz/sciencevideodb" class="text-blue-600 hover:underline">Science Video Database</a> - Companion project for research video content management.
|
| 378 |
+
</p>
|
| 379 |
+
</div>
|
| 380 |
+
</div>
|
| 381 |
</div>
|
| 382 |
</div>
|
| 383 |
|
| 384 |
+
<!-- Research Papers Metadata Database -->
|
| 385 |
+
<div class="bg-white rounded-xl shadow-lg p-8">
|
| 386 |
+
<div class="flex items-start mb-4">
|
| 387 |
+
<span class="text-4xl mr-4">π</span>
|
| 388 |
+
<div class="flex-1">
|
| 389 |
+
<h3 class="text-2xl font-bold text-gray-900 mb-3">Research Papers Metadata Database (Phase 2)</h3>
|
| 390 |
+
<p class="text-gray-600 mb-4">
|
| 391 |
+
A centralized <strong>metadata repository</strong> (not a file archive) providing structured JSON objects
|
| 392 |
+
with AI-powered preprocessing.
|
| 393 |
+
</p>
|
| 394 |
+
<div class="grid md:grid-cols-2 gap-4">
|
| 395 |
+
<div>
|
| 396 |
+
<h4 class="font-semibold text-gray-800 mb-2">Structured JSON Objects:</h4>
|
| 397 |
+
<ul class="text-sm text-gray-600 space-y-1">
|
| 398 |
+
<li>β’ DOI, arXiv ID, publication info</li>
|
| 399 |
+
<li>β’ Abstracts & key findings</li>
|
| 400 |
+
<li>β’ Extracted entities (genes, proteins, compounds, equations)</li>
|
| 401 |
+
<li>β’ Citation networks & cross-references</li>
|
| 402 |
+
<li>β’ Paradigm shift indicators</li>
|
| 403 |
+
<li>β’ Quality scores & relevance metrics</li>
|
| 404 |
+
</ul>
|
| 405 |
+
</div>
|
| 406 |
+
<div>
|
| 407 |
+
<h4 class="font-semibold text-gray-800 mb-2">AI-Powered Preprocessing:</h4>
|
| 408 |
+
<ul class="text-sm text-gray-600 space-y-1">
|
| 409 |
+
<li>β’ LLM-based entity extraction</li>
|
| 410 |
+
<li>β’ Automatic categorization</li>
|
| 411 |
+
<li>β’ Keyword extraction & semantic tagging</li>
|
| 412 |
+
<li>β’ Citation tracking & mapping</li>
|
| 413 |
+
<li>β’ Quality assessment</li>
|
| 414 |
+
<li>β’ RESTful API access</li>
|
| 415 |
+
</ul>
|
| 416 |
+
</div>
|
| 417 |
+
</div>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 418 |
</div>
|
| 419 |
</div>
|
| 420 |
</div>
|
| 421 |
</div>
|
| 422 |
</section>
|
| 423 |
|
| 424 |
+
<!-- Methodological Details -->
|
| 425 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 426 |
+
<div class="bg-white rounded-xl shadow-lg p-8 mb-8">
|
| 427 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-6">π¬ Methodology & System Design</h2>
|
| 428 |
+
|
| 429 |
+
<div class="grid md:grid-cols-2 gap-6 mb-6">
|
| 430 |
+
<div class="bg-blue-50 rounded-lg p-6">
|
| 431 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-4">Multi-Source Validation Process</h3>
|
| 432 |
+
<p class="text-gray-700 mb-3">
|
| 433 |
+
The system requires a <strong>minimum of 3 research sources</strong> per podcast episode. Each source is:
|
| 434 |
+
</p>
|
| 435 |
+
<ul class="text-sm text-gray-700 space-y-2">
|
| 436 |
+
<li>β’ Retrieved from authoritative academic databases (PubMed, arXiv, NASA ADS, etc.)</li>
|
| 437 |
+
<li>β’ Validated for authenticity and publication status</li>
|
| 438 |
+
<li>β’ Scored for quality and relevance to the research topic</li>
|
| 439 |
+
<li>β’ Cross-referenced to verify consistency and eliminate conflicting information</li>
|
| 440 |
+
<li>β’ Processed through parallel API queries for comprehensive coverage</li>
|
| 441 |
+
</ul>
|
| 442 |
+
</div>
|
| 443 |
+
|
| 444 |
+
<div class="bg-green-50 rounded-lg p-6">
|
| 445 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-4">Quality Assurance Mechanisms</h3>
|
| 446 |
+
<ul class="text-sm text-gray-700 space-y-2">
|
| 447 |
+
<li>β’ <strong>Source Verification:</strong> Automated checking of DOI, arXiv IDs, and publication metadata</li>
|
| 448 |
+
<li>β’ <strong>Relevance Scoring:</strong> LLM-based assessment of paper relevance to query</li>
|
| 449 |
+
<li>β’ <strong>Paradigm Shift Detection:</strong> Identification of revolutionary vs. incremental research</li>
|
| 450 |
+
<li>β’ <strong>Citation Extraction:</strong> Automatic extraction and formatting of citations</li>
|
| 451 |
+
<li>β’ <strong>Content Validation:</strong> Multi-model verification (Gemini, GPT-4, Claude) for accuracy</li>
|
| 452 |
+
</ul>
|
| 453 |
+
</div>
|
| 454 |
</div>
|
| 455 |
+
|
| 456 |
+
<div class="bg-purple-50 rounded-lg p-6 mb-6">
|
| 457 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-4">Citation Extraction & Verification</h3>
|
| 458 |
+
<p class="text-gray-700 mb-3">
|
| 459 |
+
The system automatically extracts and formats citations from research papers:
|
|
|
|
|
|
|
| 460 |
</p>
|
| 461 |
+
<ul class="text-sm text-gray-700 space-y-2">
|
| 462 |
+
<li>β’ DOI resolution and metadata enrichment</li>
|
| 463 |
+
<li>β’ arXiv ID parsing and preprint identification</li>
|
| 464 |
+
<li>β’ Author, title, and publication information extraction</li>
|
| 465 |
+
<li>β’ Cross-reference linking between related papers</li>
|
| 466 |
+
<li>β’ Citation network analysis for relationship mapping</li>
|
| 467 |
+
</ul>
|
| 468 |
</div>
|
| 469 |
+
|
| 470 |
+
<div class="bg-orange-50 rounded-lg p-6">
|
| 471 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-4">Paradigm Shift Detection Implementation</h3>
|
| 472 |
+
<p class="text-gray-700 mb-3">
|
| 473 |
+
The system uses LLM analysis to identify paradigm-shifting research by:
|
|
|
|
|
|
|
| 474 |
</p>
|
| 475 |
+
<ul class="text-sm text-gray-700 space-y-2">
|
| 476 |
+
<li>β’ Analyzing citation patterns and impact metrics</li>
|
| 477 |
+
<li>β’ Detecting novel methodologies or breakthrough discoveries</li>
|
| 478 |
+
<li>β’ Comparing against established knowledge frameworks</li>
|
| 479 |
+
<li>β’ Identifying interdisciplinary connections and cross-domain insights</li>
|
| 480 |
+
<li>β’ Flagging research that challenges existing paradigms</li>
|
| 481 |
+
</ul>
|
| 482 |
</div>
|
| 483 |
</div>
|
| 484 |
</section>
|
| 485 |
|
| 486 |
+
<!-- Technology Stack -->
|
|
|
|
| 487 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 488 |
+
<div class="bg-white rounded-xl shadow-lg p-8">
|
| 489 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-6">βοΈ Technology Stack</h2>
|
|
|
|
|
|
|
|
|
|
| 490 |
|
| 491 |
+
<div class="grid md:grid-cols-3 gap-6 mb-6">
|
| 492 |
+
<div>
|
| 493 |
+
<h3 class="text-lg font-semibold text-gray-800 mb-3">AI & Machine Learning</h3>
|
| 494 |
+
<ul class="text-sm text-gray-600 space-y-1">
|
| 495 |
+
<li>β’ Google Gemini 3</li>
|
| 496 |
+
<li>β’ Google Vertex AI (model orchestration)</li>
|
| 497 |
+
<li>β’ OpenAI GPT-4/GPT-3.5</li>
|
| 498 |
+
<li>β’ Anthropic Claude 3</li>
|
| 499 |
+
<li>β’ ElevenLabs TTS</li>
|
| 500 |
+
<li>β’ DALL-E 3</li>
|
| 501 |
+
<li>β’ Cloud Vision API</li>
|
| 502 |
+
<li>β’ Video Intelligence API</li>
|
| 503 |
+
</ul>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 504 |
</div>
|
| 505 |
+
|
| 506 |
+
<div>
|
| 507 |
+
<h3 class="text-lg font-semibold text-gray-800 mb-3">Backend Infrastructure</h3>
|
| 508 |
+
<ul class="text-sm text-gray-600 space-y-1">
|
| 509 |
+
<li>β’ FastAPI (Python)</li>
|
| 510 |
+
<li>β’ Google Cloud Run</li>
|
| 511 |
+
<li>β’ Firestore (NoSQL)</li>
|
| 512 |
+
<li>β’ Cloud Storage</li>
|
| 513 |
+
<li>β’ Cloud Functions</li>
|
| 514 |
+
<li>β’ Cloud Tasks</li>
|
| 515 |
+
<li>β’ Secret Manager</li>
|
| 516 |
+
</ul>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 517 |
</div>
|
| 518 |
+
|
| 519 |
+
<div>
|
| 520 |
+
<h3 class="text-lg font-semibold text-gray-800 mb-3">Frontend</h3>
|
| 521 |
+
<ul class="text-sm text-gray-600 space-y-1">
|
| 522 |
+
<li>β’ Next.js 15.5.7</li>
|
| 523 |
+
<li>β’ Alpine.js</li>
|
| 524 |
+
<li>β’ Tailwind CSS</li>
|
| 525 |
+
<li>β’ Vercel</li>
|
| 526 |
+
</ul>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 527 |
</div>
|
| 528 |
</div>
|
| 529 |
</div>
|
| 530 |
</section>
|
| 531 |
|
| 532 |
+
<!-- Limitations & Future Work -->
|
| 533 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 534 |
+
<div class="bg-white rounded-xl shadow-lg p-8 mb-8">
|
| 535 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-6">π Limitations & Future Directions</h2>
|
| 536 |
|
| 537 |
+
<div class="grid md:grid-cols-2 gap-6">
|
| 538 |
+
<div class="bg-yellow-50 rounded-lg p-6 border-l-4 border-yellow-400">
|
| 539 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-4">Current Limitations</h3>
|
| 540 |
+
<ul class="text-sm text-gray-700 space-y-2">
|
| 541 |
+
<li>β’ <strong>Discipline Coverage:</strong> Currently indexing 23,246 papers across multiple disciplines; expansion to additional disciplines in progress</li>
|
| 542 |
+
<li>β’ <strong>Source Bias:</strong> Coverage depends on database API availability and open access policies</li>
|
| 543 |
+
<li>β’ <strong>LLM Accuracy:</strong> Content generation relies on LLM accuracy; multi-source validation mitigates but doesn't eliminate errors</li>
|
| 544 |
+
<li>β’ <strong>Real-Time Updates:</strong> Knowledge graph updates require manual or scheduled processing cycles</li>
|
| 545 |
+
<li>β’ <strong>Language:</strong> Currently optimized for English-language research papers</li>
|
| 546 |
</ul>
|
| 547 |
</div>
|
| 548 |
+
|
| 549 |
+
<div class="bg-blue-50 rounded-lg p-6 border-l-4 border-blue-400">
|
| 550 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-4">Future Development</h3>
|
| 551 |
+
<ul class="text-sm text-gray-700 space-y-2">
|
| 552 |
+
<li>β’ <strong>Multi-Discipline Expansion:</strong> Expanding knowledge graph to Biology, Chemistry, Physics, Computer Science</li>
|
| 553 |
+
<li>β’ <strong>Process Databases:</strong> Creating comprehensive flowchart databases for all 5 disciplines (~50 processes each)</li>
|
| 554 |
+
<li>β’ <strong>Advanced Video Features:</strong> Dynamic visualizations, animations, and multi-layer composition</li>
|
| 555 |
+
<li>β’ <strong>Multi-Language Support:</strong> Extending to non-English research papers</li>
|
| 556 |
+
<li>β’ <strong>Enhanced Validation:</strong> Peer review mechanisms and user feedback integration</li>
|
| 557 |
+
<li>β’ <strong>Real-Time Updates:</strong> Automated continuous knowledge graph updates</li>
|
| 558 |
</ul>
|
| 559 |
</div>
|
| 560 |
+
</div>
|
| 561 |
+
</div>
|
| 562 |
+
</section>
|
| 563 |
|
| 564 |
+
<!-- Research & Collaborative Tools -->
|
| 565 |
+
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 566 |
+
<div class="bg-gradient-to-r from-green-50 to-blue-50 rounded-xl p-8">
|
| 567 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-6">π¬ Collaborative Research Tools</h2>
|
| 568 |
+
|
| 569 |
+
<div class="grid md:grid-cols-2 gap-6 mb-6">
|
| 570 |
<div>
|
| 571 |
+
<h3 class="text-xl font-semibold text-gray-800 mb-3">Collaborative Research Tools</h3>
|
| 572 |
+
<p class="text-gray-700 mb-3">
|
| 573 |
+
These platforms enable collective participation and collaboration across diverse user communities:
|
| 574 |
+
</p>
|
| 575 |
+
<ul class="text-gray-700 space-y-2">
|
| 576 |
+
<li>β’ <strong>Researchers</strong> - Tools for hypothesis formation and testing, cross-disciplinary synthesis</li>
|
| 577 |
+
<li>β’ <strong>Collaborators</strong> - Collective knowledge exploration and refinement</li>
|
| 578 |
+
<li>β’ <strong>Subscribers</strong> - Prompt, generate, and share podcasts (public or private)</li>
|
| 579 |
+
<li>β’ <strong>Community</strong> - User suggestions, comments, and collaborative flowchart improvement (GLMP)</li>
|
| 580 |
</ul>
|
| 581 |
+
<p class="text-gray-600 mt-4 italic">
|
| 582 |
+
Like a microscope enables observation of the microscopic world, these tools enable observation and
|
| 583 |
+
exploration of humanity's collective knowledge.
|
| 584 |
+
</p>
|
| 585 |
</div>
|
| 586 |
+
|
| 587 |
<div>
|
| 588 |
+
<h3 class="text-xl font-semibold text-gray-800 mb-3">Key Innovations</h3>
|
| 589 |
+
<ul class="text-gray-700 space-y-2">
|
| 590 |
+
<li>β’ Multi-source validation (min 3 sources)</li>
|
| 591 |
+
<li>β’ Evidence-based generation</li>
|
| 592 |
+
<li>β’ Paradigm shift detection</li>
|
| 593 |
+
<li>β’ Interdisciplinary connections</li>
|
| 594 |
+
<li>β’ Multiple expertise levels</li>
|
| 595 |
+
<li>β’ Full citation tracking</li>
|
| 596 |
</ul>
|
| 597 |
</div>
|
| 598 |
</div>
|
| 599 |
</div>
|
| 600 |
</section>
|
| 601 |
|
| 602 |
+
<!-- Prior Work & Research Contributions -->
|
| 603 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 604 |
+
<div class="bg-gradient-to-r from-purple-50 to-blue-50 rounded-xl shadow-lg p-8">
|
| 605 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-6">π Prior Work & Research Contributions</h2>
|
| 606 |
+
|
| 607 |
+
<div class="bg-white rounded-lg p-6 mb-6">
|
| 608 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-4">Overview</h3>
|
| 609 |
+
<p class="text-gray-700 mb-4">
|
| 610 |
+
This platform represents <strong>prior work</strong> that demonstrates foundational research and development
|
| 611 |
+
achievements in AI-powered scientific knowledge synthesis, collaborative research tools, and multi-modal content
|
| 612 |
+
generation. These contributions establish the technical foundation and proof-of-concept for the broader
|
| 613 |
+
<strong>CopernicusAI Knowledge Engine</strong> initiative.
|
| 614 |
+
</p>
|
| 615 |
+
</div>
|
| 616 |
+
|
| 617 |
+
<div class="grid md:grid-cols-2 gap-6 mb-6">
|
| 618 |
+
<div class="bg-white rounded-lg p-6">
|
| 619 |
+
<h3 class="text-lg font-semibold text-gray-900 mb-3">π¬ Research Contributions</h3>
|
| 620 |
+
<ul class="text-sm text-gray-700 space-y-2">
|
| 621 |
+
<li>β’ <strong>AI-Powered Research Synthesis:</strong> Production system for multi-source research synthesis using LLMs</li>
|
| 622 |
+
<li>β’ <strong>Multi-Model Architecture:</strong> Intelligent model selection with Gemini 3, GPT-4, Claude 3</li>
|
| 623 |
+
<li>β’ <strong>Collaborative Platform:</strong> Subscriber-driven content generation with public/private sharing</li>
|
| 624 |
+
<li>β’ <strong>Knowledge Engine Integration:</strong> Architecture for Research Papers DB, Video DB, GLMP, Framework</li>
|
| 625 |
+
</ul>
|
| 626 |
+
</div>
|
| 627 |
+
|
| 628 |
+
<div class="bg-white rounded-lg p-6">
|
| 629 |
+
<h3 class="text-lg font-semibold text-gray-900 mb-3">βοΈ Technical Achievements</h3>
|
| 630 |
+
<ul class="text-sm text-gray-700 space-y-2">
|
| 631 |
+
<li>β’ <strong>250+ Million Papers:</strong> Accessible via 8+ integrated academic databases</li>
|
| 632 |
+
<li>β’ <strong>79 Episodes:</strong> Generated across 5 scientific disciplines</li>
|
| 633 |
+
<li>β’ <strong>Production Deployment:</strong> Live platform with operational API and RSS distribution</li>
|
| 634 |
+
<li>β’ <strong>Scalable Architecture:</strong> Serverless microservices on Google Cloud</li>
|
| 635 |
+
</ul>
|
| 636 |
+
</div>
|
| 637 |
+
</div>
|
| 638 |
+
|
| 639 |
+
<div class="bg-white rounded-lg p-6 mb-6">
|
| 640 |
+
<h3 class="text-lg font-semibold text-gray-900 mb-3">π― Position Within CopernicusAI Knowledge Engine</h3>
|
| 641 |
+
<p class="text-gray-700 mb-3">
|
| 642 |
+
This platform serves as the <strong>core synthesis and distribution component</strong> of the CopernicusAI Knowledge Engine.
|
| 643 |
+
The Knowledge Engine is an integrated ecosystem of research and collaboration tools that work together to assist scientists
|
| 644 |
+
in their workflow, from research discovery through knowledge synthesis to multi-format content generation.
|
| 645 |
</p>
|
| 646 |
+
<div class="bg-blue-50 rounded-lg p-4 mb-3">
|
| 647 |
+
<h4 class="font-semibold text-gray-900 mb-2">Current Components:</h4>
|
| 648 |
+
<div class="grid md:grid-cols-2 gap-4 text-sm">
|
| 649 |
+
<ul class="text-gray-700 space-y-1">
|
| 650 |
+
<li>1. <strong>CopernicusAI</strong> (This platform) - Core synthesis & distribution</li>
|
| 651 |
+
<li>2. <strong>Programming Framework</strong> - Foundational meta-tool</li>
|
| 652 |
+
<li>3. <strong>GLMP</strong> - Biological process visualization</li>
|
| 653 |
+
</ul>
|
| 654 |
+
<ul class="text-gray-700 space-y-1">
|
| 655 |
+
<li>4. <strong>Research Paper Metadata Database</strong> - Data infrastructure</li>
|
| 656 |
+
<li>5. <strong>Science Video Database</strong> - Multi-modal content</li>
|
| 657 |
+
</ul>
|
| 658 |
+
</div>
|
| 659 |
+
</div>
|
| 660 |
+
<div class="bg-purple-50 rounded-lg p-4">
|
| 661 |
+
<h4 class="font-semibold text-gray-900 mb-2">Future Development:</h4>
|
| 662 |
+
<p class="text-gray-700 text-sm">
|
| 663 |
+
The Knowledge Engine is designed to grow and evolve. Additional tools, databases, and collaboration components
|
| 664 |
+
will be added as the project develops, expanding capabilities for AI-assisted scientific research and knowledge discovery.
|
| 665 |
+
</p>
|
| 666 |
+
</div>
|
| 667 |
</div>
|
| 668 |
|
| 669 |
+
<div class="bg-blue-50 rounded-lg p-6">
|
| 670 |
+
<h3 class="text-lg font-semibold text-gray-900 mb-3">π Citation Information</h3>
|
| 671 |
+
<p class="text-sm text-gray-700 mb-3">
|
| 672 |
+
<strong>For Grant Proposals (NSF/DOE):</strong>
|
|
|
|
| 673 |
</p>
|
| 674 |
+
<div class="bg-white rounded p-4 font-mono text-sm text-gray-800 mb-4">
|
| 675 |
+
<p class="mb-2">Welz, G. (2025). CopernicusAI: Knowledge Engine for Scientific Discovery.</p>
|
| 676 |
+
<p class="mb-2">Hugging Face Space. https://huggingface.co/spaces/garywelz/copernicusai</p>
|
| 677 |
+
<p>Live Platform: https://www.copernicusai.fyi</p>
|
| 678 |
+
</div>
|
| 679 |
+
<div class="bg-white rounded p-4 mb-4">
|
| 680 |
+
<p class="text-sm font-semibold text-gray-700 mb-2">BibTeX Format:</p>
|
| 681 |
+
<pre class="bg-gray-900 text-green-400 p-3 rounded text-xs overflow-x-auto"><code>@misc{welz2025copernicusai,
|
| 682 |
+
title={CopernicusAI: Knowledge Engine for Scientific Discovery},
|
| 683 |
+
author={Welz, Gary},
|
| 684 |
+
year={2025},
|
| 685 |
+
url={https://huggingface.co/spaces/garywelz/copernicusai},
|
| 686 |
+
note={Hugging Face Space, Live Platform: https://www.copernicusai.fyi}
|
| 687 |
+
}</code></pre>
|
| 688 |
+
</div>
|
| 689 |
+
</div>
|
| 690 |
+
</div>
|
| 691 |
+
</section>
|
| 692 |
+
|
| 693 |
+
<!-- Data Availability Statement -->
|
| 694 |
+
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 695 |
+
<div class="bg-white rounded-xl shadow-lg p-8 mb-8">
|
| 696 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-6">π Data Availability Statement</h2>
|
| 697 |
+
|
| 698 |
+
<div class="bg-blue-50 rounded-lg p-6 mb-4">
|
| 699 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-4">Platform Access</h3>
|
| 700 |
+
<ul class="text-gray-700 space-y-2">
|
| 701 |
+
<li>β’ <strong>Live Platform:</strong> <a href="https://www.copernicusai.fyi" target="_blank" class="text-blue-600 hover:underline">https://www.copernicusai.fyi</a> (opens in new tab)</li>
|
| 702 |
+
<li>β’ <strong>Knowledge Engine Dashboard:</strong> <a href="https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine" target="_blank" class="text-blue-600 hover:underline">https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine</a> (opens in new tab)</li>
|
| 703 |
+
<li>β’ <strong>API Base URL:</strong> <code class="bg-gray-100 px-2 py-1 rounded">https://copernicus-podcast-api-phzp4ie2sq-uc.a.run.app</code></li>
|
| 704 |
+
<li>β’ <strong>RSS Feed:</strong> <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/feeds/copernicus-mvp-rss-feed.xml" target="_blank" class="text-blue-600 hover:underline">Available for public access</a> (opens in new tab)</li>
|
| 705 |
+
</ul>
|
| 706 |
+
</div>
|
| 707 |
+
|
| 708 |
+
<div class="bg-green-50 rounded-lg p-6 mb-4">
|
| 709 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-4">Data & Code Availability</h3>
|
| 710 |
+
<ul class="text-gray-700 space-y-2">
|
| 711 |
+
<li>β’ <strong>Hugging Face Spaces:</strong> All components accessible at <a href="https://huggingface.co/garywelz" target="_blank" class="text-blue-600 hover:underline">https://huggingface.co/garywelz</a> (opens in new tab)</li>
|
| 712 |
+
<li>β’ <strong>Process Flowcharts (GLMP):</strong> JSON files stored in Google Cloud Storage, accessible via <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/glmp-database-table.html" target="_blank" class="text-blue-600 hover:underline">GLMP Database Table</a> (opens in new tab)</li>
|
| 713 |
+
<li>β’ <strong>Research Paper Metadata:</strong> 23,246 indexed papers with metadata accessible through Knowledge Engine Dashboard</li>
|
| 714 |
+
<li>β’ <strong>API Documentation:</strong> RESTful API endpoints available for programmatic access (see API Documentation section)</li>
|
| 715 |
+
</ul>
|
| 716 |
+
</div>
|
| 717 |
+
|
| 718 |
+
<div class="bg-purple-50 rounded-lg p-6">
|
| 719 |
+
<h3 class="text-xl font-semibold text-gray-900 mb-4">Reproducibility Information</h3>
|
| 720 |
+
<ul class="text-gray-700 space-y-2">
|
| 721 |
+
<li>β’ <strong>Technology Stack:</strong> All technologies and versions documented in Technology Stack section</li>
|
| 722 |
+
<li>β’ <strong>LLM Models:</strong> Google Gemini 3, OpenAI GPT-4/GPT-3.5, Anthropic Claude 3 (versions specified in documentation)</li>
|
| 723 |
+
<li>β’ <strong>Source Citations:</strong> All podcast episodes include full citations to source papers</li>
|
| 724 |
+
<li>β’ <strong>Metadata:</strong> Complete metadata for all generated content available through API</li>
|
| 725 |
+
<li>β’ <strong>License:</strong> MIT License - see license information in space metadata</li>
|
| 726 |
+
</ul>
|
| 727 |
</div>
|
| 728 |
</div>
|
| 729 |
</section>
|
|
|
|
| 732 |
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 733 |
<div class="bg-white rounded-xl shadow-lg p-8">
|
| 734 |
<h2 class="text-3xl font-bold text-gray-900 mb-6">How to Cite This Work</h2>
|
| 735 |
+
<div class="bg-gray-50 rounded-lg p-6 mb-4">
|
| 736 |
+
<p class="text-gray-800 font-mono text-lg leading-relaxed mb-4">
|
| 737 |
+
Welz, G. (2024β2025). <em>CopernicusAI: AI-Generated Audio Briefings as a Research Interface</em>.<br>
|
| 738 |
+
Hugging Face Spaces. https://huggingface.co/spaces/garywelz/copernicusai
|
| 739 |
+
</p>
|
| 740 |
+
|
| 741 |
+
<div class="border-t border-gray-300 pt-4 mt-4">
|
| 742 |
+
<p class="text-sm font-semibold text-gray-700 mb-2">BibTeX Format:</p>
|
| 743 |
+
<pre class="bg-gray-800 text-green-400 p-4 rounded text-sm overflow-x-auto"><code>@misc{welz2025copernicusai,
|
| 744 |
+
title={CopernicusAI: AI-Generated Audio Briefings as a Research Interface},
|
| 745 |
author={Welz, Gary},
|
| 746 |
year={2024--2025},
|
| 747 |
+
url={https://huggingface.co/spaces/garywelz/copernicusai},
|
| 748 |
+
note={Hugging Face Space}
|
| 749 |
}</code></pre>
|
| 750 |
+
</div>
|
| 751 |
+
</div>
|
| 752 |
+
</div>
|
| 753 |
+
</section>
|
| 754 |
+
|
| 755 |
+
<!-- Grant Support & Collaboration -->
|
| 756 |
+
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 757 |
+
<div class="bg-white rounded-xl shadow-lg p-8">
|
| 758 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-6">π Grant Support & Collaboration</h2>
|
| 759 |
+
|
| 760 |
+
<div class="mb-6">
|
| 761 |
+
<h3 class="text-xl font-semibold text-gray-800 mb-3">Grant Applications Supported</h3>
|
| 762 |
+
<p class="text-gray-700 mb-4">
|
| 763 |
+
This platform is designed to support grant applications to:
|
| 764 |
+
</p>
|
| 765 |
+
<div class="grid md:grid-cols-3 gap-4">
|
| 766 |
+
<div class="bg-blue-50 rounded-lg p-4">
|
| 767 |
+
<h4 class="font-semibold text-gray-800 mb-2">NSF</h4>
|
| 768 |
+
<p class="text-sm text-gray-600">National Science Foundation - Science education and research infrastructure</p>
|
| 769 |
+
</div>
|
| 770 |
+
<div class="bg-green-50 rounded-lg p-4">
|
| 771 |
+
<h4 class="font-semibold text-gray-800 mb-2">DOE</h4>
|
| 772 |
+
<p class="text-sm text-gray-600">Department of Energy - Scientific computing and data science</p>
|
| 773 |
+
</div>
|
| 774 |
+
<div class="bg-purple-50 rounded-lg p-4">
|
| 775 |
+
<h4 class="font-semibold text-gray-800 mb-2">SAIR Foundation</h4>
|
| 776 |
+
<p class="text-sm text-gray-600">AI research and development initiatives</p>
|
| 777 |
</div>
|
| 778 |
</div>
|
| 779 |
+
</div>
|
| 780 |
+
|
| 781 |
+
<div>
|
| 782 |
+
<h3 class="text-xl font-semibold text-gray-800 mb-3">Collaboration Opportunities</h3>
|
| 783 |
+
<ul class="text-gray-700 space-y-2">
|
| 784 |
+
<li>β’ Integration with academic institutions</li>
|
| 785 |
+
<li>β’ Partnership with research organizations</li>
|
| 786 |
+
<li>β’ Open data initiatives</li>
|
| 787 |
+
<li>β’ Educational program development</li>
|
| 788 |
+
</ul>
|
| 789 |
+
</div>
|
| 790 |
+
</div>
|
| 791 |
+
</section>
|
| 792 |
+
|
| 793 |
+
<!-- Links & Resources -->
|
| 794 |
+
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
|
| 795 |
+
<div class="bg-gradient-to-r from-blue-50 to-purple-50 rounded-xl p-8">
|
| 796 |
+
<h2 class="text-3xl font-bold text-gray-900 mb-6 text-center">π Live Platform & Resources</h2>
|
| 797 |
+
|
| 798 |
+
<div class="grid md:grid-cols-2 gap-6">
|
| 799 |
+
<div class="bg-white rounded-lg p-6">
|
| 800 |
+
<h3 class="text-xl font-semibold text-gray-800 mb-4">π Production Deployment</h3>
|
| 801 |
+
<ul class="space-y-2">
|
| 802 |
+
<li>
|
| 803 |
+
<a href="https://www.copernicusai.fyi" target="_blank" rel="noopener noreferrer"
|
| 804 |
+
class="text-blue-600 hover:text-blue-800 font-medium">
|
| 805 |
+
π Homepage - Browse Podcasts (opens in new tab)
|
| 806 |
+
</a>
|
| 807 |
+
</li>
|
| 808 |
+
<li>
|
| 809 |
+
<a href="https://www.copernicusai.fyi/subscriber-dashboard.html" target="_blank" rel="noopener noreferrer"
|
| 810 |
+
class="text-blue-600 hover:text-blue-800 font-medium">
|
| 811 |
+
π Creator Dashboard (opens in new tab)
|
| 812 |
+
</a>
|
| 813 |
+
</li>
|
| 814 |
+
<li>
|
| 815 |
+
<a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/feeds/copernicus-mvp-rss-feed.xml" target="_blank" rel="noopener noreferrer"
|
| 816 |
+
class="text-blue-600 hover:text-blue-800 font-medium">
|
| 817 |
+
π‘ RSS Feed (opens in new tab)
|
| 818 |
+
</a>
|
| 819 |
+
</li>
|
| 820 |
+
</ul>
|
| 821 |
+
</div>
|
| 822 |
+
|
| 823 |
+
<div class="bg-white rounded-lg p-6">
|
| 824 |
+
<h3 class="text-xl font-semibold text-gray-800 mb-4">π§© Knowledge Engine Components</h3>
|
| 825 |
+
<p class="text-sm text-gray-600 mb-4">
|
| 826 |
+
The CopernicusAI Knowledge Engine is an integrated ecosystem of research and collaboration tools.
|
| 827 |
+
The <strong>Knowledge Engine Dashboard is now fully operational</strong> (December 2025) with a working web interface providing unified access to all components.
|
| 828 |
</p>
|
| 829 |
+
<div class="bg-green-50 rounded-lg p-4 mb-4">
|
| 830 |
+
<h4 class="font-semibold text-gray-800 mb-2">β
Knowledge Engine Dashboard (Implemented)</h4>
|
| 831 |
+
<p class="text-sm text-gray-700 mb-2">
|
| 832 |
+
Fully operational web interface with knowledge graph visualization (23,246 papers), vector search, RAG queries, and content browsing.
|
| 833 |
+
</p>
|
| 834 |
+
<a href="https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine"
|
| 835 |
+
target="_blank" rel="noopener noreferrer"
|
| 836 |
+
class="text-blue-600 hover:underline text-sm font-medium">
|
| 837 |
+
Live System: https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine β (opens in new tab)
|
| 838 |
+
</a>
|
| 839 |
+
</div>
|
| 840 |
+
<ul class="space-y-3">
|
| 841 |
+
<li>
|
| 842 |
+
<a href="https://huggingface.co/spaces/garywelz/programming_framework" target="_blank" rel="noopener noreferrer"
|
| 843 |
+
class="text-blue-600 hover:text-blue-800 font-medium">
|
| 844 |
+
π οΈ Programming Framework (opens in new tab)
|
| 845 |
+
</a>
|
| 846 |
+
<p class="text-sm text-gray-600 mt-1 ml-6">
|
| 847 |
+
Foundational meta-tool for universal process analysis across any discipline
|
| 848 |
+
</p>
|
| 849 |
+
</li>
|
| 850 |
+
<li>
|
| 851 |
+
<a href="https://huggingface.co/spaces/garywelz/glmp" target="_blank" rel="noopener noreferrer"
|
| 852 |
+
class="text-blue-600 hover:text-blue-800 font-medium">
|
| 853 |
+
𧬠GLMP - Genome Logic Modeling Project (opens in new tab)
|
| 854 |
+
</a>
|
| 855 |
+
<p class="text-sm text-gray-600 mt-1 ml-6">
|
| 856 |
+
First application of Programming Framework to biology - 50+ biological processes visualized
|
| 857 |
+
</p>
|
| 858 |
+
</li>
|
| 859 |
+
<li>
|
| 860 |
+
<a href="https://huggingface.co/spaces/garywelz/metadata_database" target="_blank" rel="noopener noreferrer"
|
| 861 |
+
class="text-blue-600 hover:text-blue-800 font-medium">
|
| 862 |
+
π Research Paper Metadata Database (opens in new tab)
|
| 863 |
+
</a>
|
| 864 |
+
<p class="text-sm text-gray-600 mt-1 ml-6">
|
| 865 |
+
Core data infrastructure for structured research paper metadata and citation networks
|
| 866 |
+
</p>
|
| 867 |
+
</li>
|
| 868 |
+
<li>
|
| 869 |
+
<a href="https://huggingface.co/spaces/garywelz/sciencevideodb" target="_blank" rel="noopener noreferrer"
|
| 870 |
+
class="text-blue-600 hover:text-blue-800 font-medium">
|
| 871 |
+
π¬ Science Video Database (opens in new tab)
|
| 872 |
+
</a>
|
| 873 |
+
<p class="text-sm text-gray-600 mt-1 ml-6">
|
| 874 |
+
Multi-modal content component with transcript-based search for scientific videos
|
| 875 |
+
</p>
|
| 876 |
+
</li>
|
| 877 |
+
</ul>
|
| 878 |
</div>
|
| 879 |
</div>
|
| 880 |
+
</div>
|
| 881 |
+
</section>
|
| 882 |
+
|
| 883 |
+
<!-- API Endpoints -->
|
| 884 |
+
<section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
|
| 885 |
+
<div class="bg-gray-900 text-white rounded-xl p-8">
|
| 886 |
+
<h2 class="text-3xl font-bold mb-6">π API Documentation</h2>
|
| 887 |
+
<p class="text-gray-300 mb-6">Base URL: <code class="bg-gray-800 px-2 py-1 rounded">https://copernicus-podcast-api-phzp4ie2sq-uc.a.run.app</code></p>
|
| 888 |
+
|
| 889 |
+
<div class="grid md:grid-cols-3 gap-4 text-sm mb-8">
|
| 890 |
+
<div>
|
| 891 |
+
<h4 class="font-semibold text-blue-300 mb-2">Podcast Generation</h4>
|
| 892 |
+
<ul class="space-y-1 text-gray-400">
|
| 893 |
+
<li>POST /generate-podcast-with-subscriber</li>
|
| 894 |
+
<li>GET /api/subscribers/podcasts/{id}</li>
|
| 895 |
+
<li>POST /api/subscribers/podcasts/submit-to-rss</li>
|
| 896 |
+
</ul>
|
| 897 |
+
</div>
|
| 898 |
+
|
| 899 |
+
<div>
|
| 900 |
+
<h4 class="font-semibold text-blue-300 mb-2">Research Endpoints</h4>
|
| 901 |
+
<ul class="space-y-1 text-gray-400">
|
| 902 |
+
<li>POST /api/papers/upload</li>
|
| 903 |
+
<li>GET /api/papers/{paper_id}</li>
|
| 904 |
+
<li>POST /api/papers/query</li>
|
| 905 |
+
<li>POST /api/papers/{id}/link-podcast/{id}</li>
|
| 906 |
+
</ul>
|
| 907 |
+
</div>
|
| 908 |
+
|
| 909 |
+
<div>
|
| 910 |
+
<h4 class="font-semibold text-blue-300 mb-2">Admin Endpoints</h4>
|
| 911 |
+
<ul class="space-y-1 text-gray-400">
|
| 912 |
+
<li>GET /api/admin/subscribers</li>
|
| 913 |
+
<li>POST /api/admin/podcasts/fix-missing-titles</li>
|
| 914 |
+
<li>GET /api/admin/podcasts/catalog</li>
|
| 915 |
+
</ul>
|
| 916 |
+
</div>
|
| 917 |
+
</div>
|
| 918 |
+
|
| 919 |
+
<div class="border-t border-gray-700 pt-6 mt-6">
|
| 920 |
+
<h3 class="text-xl font-semibold text-blue-300 mb-4">π Example Request</h3>
|
| 921 |
+
<div class="bg-gray-800 rounded-lg p-4 mb-4">
|
| 922 |
+
<p class="text-gray-400 text-xs mb-2">POST /api/papers/query</p>
|
| 923 |
+
<pre class="text-green-400 text-xs overflow-x-auto"><code>{
|
| 924 |
+
"discipline": "biology",
|
| 925 |
+
"keywords": ["DNA replication", "cell cycle"],
|
| 926 |
+
"date_range": {
|
| 927 |
+
"start": "2020-01-01",
|
| 928 |
+
"end": "2025-01-01"
|
| 929 |
+
},
|
| 930 |
+
"limit": 10
|
| 931 |
+
}</code></pre>
|
| 932 |
+
</div>
|
| 933 |
+
|
| 934 |
+
<h3 class="text-xl font-semibold text-blue-300 mb-4 mt-6">π€ Example Response</h3>
|
| 935 |
+
<div class="bg-gray-800 rounded-lg p-4 mb-4">
|
| 936 |
+
<pre class="text-green-400 text-xs overflow-x-auto"><code>{
|
| 937 |
+
"status": "success",
|
| 938 |
+
"count": 10,
|
| 939 |
+
"papers": [
|
| 940 |
+
{
|
| 941 |
+
"id": "pmid_12345678",
|
| 942 |
+
"title": "Mechanisms of DNA Replication...",
|
| 943 |
+
"authors": ["Smith, J.", "Doe, A."],
|
| 944 |
+
"journal": "Nature",
|
| 945 |
+
"year": 2023,
|
| 946 |
+
"doi": "10.1038/s41586-023-01234",
|
| 947 |
+
"abstract": "..."
|
| 948 |
+
}
|
| 949 |
+
]
|
| 950 |
+
}</code></pre>
|
| 951 |
+
</div>
|
| 952 |
+
|
| 953 |
+
<div class="bg-gray-800 rounded-lg p-4 mt-4">
|
| 954 |
+
<h4 class="font-semibold text-blue-300 mb-2 text-sm">π Authentication</h4>
|
| 955 |
+
<p class="text-gray-400 text-xs mb-2">API uses Bearer token authentication. Include in request headers:</p>
|
| 956 |
+
<pre class="text-green-400 text-xs"><code>Authorization: Bearer YOUR_API_TOKEN</code></pre>
|
| 957 |
+
</div>
|
| 958 |
+
|
| 959 |
+
<div class="bg-gray-800 rounded-lg p-4 mt-4">
|
| 960 |
+
<h4 class="font-semibold text-blue-300 mb-2 text-sm">β‘ Rate Limits</h4>
|
| 961 |
+
<p class="text-gray-400 text-xs">Standard rate limits apply: 100 requests/minute per API key. Contact for higher limits.</p>
|
| 962 |
+
</div>
|
| 963 |
+
|
| 964 |
+
<div class="bg-gray-800 rounded-lg p-4 mt-4">
|
| 965 |
+
<h4 class="font-semibold text-blue-300 mb-2 text-sm">π API Version</h4>
|
| 966 |
+
<p class="text-gray-400 text-xs">Current version: v1.0. API is stable and backward-compatible.</p>
|
| 967 |
+
</div>
|
| 968 |
</div>
|
| 969 |
</div>
|
| 970 |
</section>
|
|
|
|
| 972 |
<!-- Footer -->
|
| 973 |
<footer class="gradient-bg text-white py-8 mt-12">
|
| 974 |
<div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 text-center">
|
| 975 |
+
<p class="text-lg font-semibold mb-2">CopernicusAI - Advancing Scientific Knowledge</p>
|
| 976 |
+
<p class="text-sm opacity-75">Built with Google Cloud, Gemini AI, OpenAI, Anthropic Claude, and ElevenLabs</p>
|
| 977 |
+
<p class="text-xs opacity-50 mt-4">© 2025 CopernicusAI. All rights reserved.</p>
|
| 978 |
</div>
|
| 979 |
</footer>
|
|
|
|
|
|
|
|
|
|
|
|
|
| 980 |
</body>
|
| 981 |
</html>
|
|
|