garywelz commited on
Commit
b90bc7e
Β·
verified Β·
1 Parent(s): f9a72c8

Upload 2 files

Browse files
Files changed (2) hide show
  1. README.md +501 -117
  2. index.html +782 -371
README.md CHANGED
@@ -1,179 +1,563 @@
1
  ---
2
- title: The Programming Framework
3
- emoji: πŸ› οΈ
4
- colorFrom: yellow
5
- colorTo: red
6
  sdk: static
7
- pinned: true
8
  license: mit
9
  ---
10
 
11
- # πŸ› οΈ The Programming Framework
12
 
13
- A Universal Method for Process Analysis
14
 
15
  ## Summary
16
 
17
- The **Programming Framework** is a universal meta-tool for analyzing complex processes across any discipline by combining Large Language Models (LLMs) with visual flowchart representation. The Framework transforms textual process descriptions into structured, interactive Mermaid flowcharts stored as JSON, enabling systematic analysis, visualization, and integration with knowledge systems.
18
 
19
- Successfully demonstrated through GLMP (Genome Logic Modeling Project) with 50+ biological processes, and applied across Chemistry, Mathematics, Physics, and Computer Science. The Framework serves as the foundational methodology for the CopernicusAI Knowledge Engine, enabling domain-specific process visualization and analysis.
20
 
21
- ## πŸ“š Prior Work & Research Contributions
22
 
23
- ### Overview
24
- The Programming Framework represents **prior work** that demonstrates a novel methodology for analyzing complex processes by combining Large Language Models (LLMs) with visual flowchart representation. This research establishes a universal, domain-agnostic approach to process analysis that transforms textual descriptions into structured, interactive visualizations.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
 
26
- ### πŸ”¬ Research Contributions
27
- - **Universal Process Analysis:** Domain-agnostic methodology applicable across biology, chemistry, software engineering, business processes, and more
28
- - **LLM-Powered Extraction:** Automated extraction of process steps, decision points, and logic flows using Google Gemini 2.0 Flash
29
- - **Structured Visualization:** Mermaid.js-based flowchart generation encoded as JSON for programmatic access and integration
30
- - **Iterative Refinement:** Systematic approach enabling continuous improvement through visualization and LLM-assisted refinement
31
 
32
- ### βš™οΈ Technical Achievements
33
- - **Meta-Tool Architecture:** Framework for creating specialized process analysis tools (demonstrated by GLMP)
34
- - **JSON-Based Storage:** Structured data format enabling version control, cross-referencing, and API integration
35
- - **Multi-Domain Application:** Successfully applied to biological processes (GLMP), with extensions planned for software, business, and engineering domains
36
- - **Integration Framework:** Designed for integration with knowledge engines, research databases, and collaborative platforms
 
 
37
 
38
- ### 🎯 Position Within CopernicusAI Knowledge Engine
39
- The Programming Framework serves as the **foundational meta-tool** of the CopernicusAI Knowledge Engine, providing the underlying methodology that enables specialized applications:
40
 
41
- - **GLMP (Genome Logic Modeling Project)** - First specialized application demonstrating biological process visualization
42
- - **Copernicus AI** - Main knowledge engine integrating Framework outputs with AI podcasts and research synthesis
43
- - **Knowledge Engine Dashboard** (βœ… Implemented December 2025) - Fully operational web interface with knowledge graph visualization, vector search, RAG queries, and content browsing. Processes from Chemistry, Physics, Mathematics, and Computer Science are accessible through the unified dashboard. Live at: https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine
44
- - **Research Papers Metadata Database** - Integration for linking processes to source literature (12,000+ papers indexed)
45
- - **Science Video Database** - Potential integration for multi-modal process explanations
 
46
 
47
- This work establishes a proof-of-concept for AI-assisted process analysis, demonstrating how LLMs can systematically extract and visualize complex logic from textual sources across diverse domains. The Knowledge Engine now provides a unified interface for exploring processes alongside research papers, podcasts, and other content types.
48
 
49
- ## 🎯 Overview
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
50
 
51
- The Programming Framework is a **meta-tool**β€”a tool for creating tools. It provides a systematic method for analyzing any complex process by combining the analytical power of Large Language Models with the clarity of visual flowcharts.
52
 
53
- ## πŸ’‘ The Core Idea
54
 
55
- **Problem:** Complex processes are difficult to understand because they involve many steps, decision points, and interactions. Traditional text descriptions are hard to follow.
 
 
 
 
56
 
57
- **Solution:** Use LLMs to extract process logic from literature, then encode it as Mermaid flowcharts stored in JSON. Result: Clear, interactive visualizations that reveal hidden patterns and enable systematic analysis.
 
 
 
58
 
59
- ## βš™οΈ How It Works
60
 
61
- 1. **Input Process** - Provide scientific papers, documentation, or process descriptions
62
- 2. **LLM Analysis** - AI extracts steps, decisions, branches, and logic flow
63
- 3. **Generate Flowchart** - Create Mermaid diagram encoded as JSON structure
64
- 4. **Visualize & Iterate** - Interactive flowchart reveals insights and enables refinement
65
 
66
- ## 🌍 Core Principles
 
 
 
67
 
68
- ### Domain Agnostic
69
- Works across any field: biology, chemistry, software engineering, business processes, legal workflows, manufacturing, and beyond.
70
 
71
- ### Iterative Refinement
72
- Start with rough analysis, visualize, identify gaps, refine with LLM, repeat until the process logic is crystal clear.
73
 
74
- ### Structured Data
75
- JSON storage enables programmatic access, version control, cross-referencing, and integration with other tools and databases.
76
 
77
- ## πŸš€ Applications
 
 
 
 
 
 
 
78
 
79
- ### 🧬 GLMP - Genome Logic Modeling (Live)
80
- First specialized application: visualizing biochemical processes like DNA replication, metabolic pathways, and cell signaling.
81
- - [Explore GLMP β†’](https://huggingface.co/spaces/garywelz/glmp)
82
 
83
- ## πŸ“š Process Diagram Collections
 
 
 
 
84
 
85
- The Programming Framework has been applied across multiple scientific disciplines. Explore interactive flowchart collections organized by domain:
 
 
 
 
86
 
87
- ### 🧬 Biology
88
- - [Biology Processes Database](https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/biology-processes-database/biology-database-table.html) - Interactive database with 52 higher-level organismal processes across 8 categories (reproduction, development, behavior, defense, nutrition, sensory, transport, coordination)
89
- - [GLMP Database Table](https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/glmp-database-table.html) - Genome Logic Modeling Project: Biochemical/molecular processes database (50+ processes)
90
- - **Note:** Biology Processes Database focuses on organismal, developmental, behavioral, and ecological processes. GLMP focuses on molecular-level biochemical processes. Together they provide comprehensive biological process coverage.
 
91
 
92
- ### βš—οΈ Chemistry
93
- - [Chemistry Database Table](https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/chemistry-processes-database/chemistry-database-table.html) - Interactive database with 56 processes across 14 subcategories
 
 
 
 
94
 
95
- ### πŸ”’ Mathematics
96
- - [Mathematics Database Table](https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/mathematics-processes-database/mathematics-database-table.html) - Interactive database with 20 processes across 7 subcategories
 
 
 
 
97
 
98
- ### βš›οΈ Physics
99
- - [Physics Database Table](https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/physics-processes-database/physics-database-table.html) - Interactive database with 21 processes across 7 subcategories
100
 
101
- ### πŸ’» Computer Science
102
- - [Computer Science Database Table](https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/computer-science-processes-database/computer-science-database-table.html) - Interactive database with 21 processes across 7 subcategories
 
103
 
104
- ## πŸ”§ Technical Architecture
105
 
106
- ### LLM Integration
107
- - Google Gemini 2.0 Flash for analysis
108
- - Vertex AI for enterprise deployment
109
- - Custom prompts for process extraction
110
- - Structured JSON output formatting
111
 
112
- ### Visualization Stack
113
- - Mermaid.js for flowchart rendering
114
- - JSON schema for data validation
115
- - Interactive SVG output
116
- - Export to PNG/PDF supported
 
 
 
 
 
 
 
 
 
 
 
 
117
 
118
- ### Data Storage
119
- - Google Cloud Storage for JSON files
120
- - Firestore for metadata indexing
121
- - Version control with Git
122
- - Cross-referencing with papers database
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
123
 
124
- ### Integration Points
125
- - GLMP specialized collections
126
- - Copernicus AI knowledge graph
127
- - Research papers database
128
- - API endpoints for programmatic access
129
 
130
- ### How to Cite This Work
131
 
132
- Welz, G. (2024–2025). *The Programming Framework: A Universal Method for Process Analysis*.
133
- Hugging Face Spaces. https://huggingface.co/spaces/garywelz/programming_framework
134
 
135
- Welz, G. (2024). *From Inspiration to AI: Biology as Visual Programming*. Medium.
136
- https://medium.com/@garywelz_47126/from-inspiration-to-ai-biology-as-visual-programming-520ee523029a
 
 
137
 
138
- This project serves as a foundational meta-tool for AI-assisted process analysis, enabling systematic extraction and visualization of complex logic from textual sources across diverse scientific and technical domains.
 
 
 
 
 
 
139
 
140
- The Programming Framework is designed as infrastructure for AI-assisted science, providing a universal methodology that can be specialized for domain-specific applications.
141
 
142
- ## πŸ”— Related Projects
143
 
144
- ### 🧬 GLMP - Genome Logic Modeling
145
- First specialized application of the Programming Framework to biochemical processes. 100+ biological pathways visualized.
146
- - [Visit GLMP β†’](https://huggingface.co/spaces/garywelz/glmp)
147
 
148
- ### πŸ”¬ Copernicus AI
149
- Knowledge engine integrating the Programming Framework with AI podcasts, research papers, and knowledge graph for scientific discovery.
150
- - [Visit Copernicus AI β†’](https://huggingface.co/spaces/garywelz/copernicusai)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
151
 
152
- ## 🎨 Interactive Demo
153
 
154
- The space includes interactive examples showing the framework applied to:
155
- - Scientific Method
156
- - Software Deployment Pipeline
157
- - Customer Support Workflow
158
- - Research Paper Publication
159
 
160
- Each example demonstrates how LLMs extract process logic and encode it as visual flowcharts.
 
161
 
162
- ## πŸ’» Technology Stack
163
 
164
- - **LLM**: Google Gemini 2.0 Flash, Vertex AI
165
- - **Visualization**: Mermaid.js
166
- - **Storage**: Google Cloud Storage, Firestore
167
- - **Format**: JSON with Mermaid syntax
168
- - **Frontend**: Static HTML + Tailwind CSS
169
 
170
- ## 🌟 Vision
171
 
172
- As AI systems become more capable of understanding complex processes, the Programming Framework provides the bridge between human comprehension and machine analysis. It's a tool for truth-seekingβ€”transforming complexity into clarity.
 
 
 
173
 
174
  ---
175
 
176
- **A Universal Method for Process Analysis**
 
 
 
 
 
 
177
 
178
- Β© 2025 Gary Welz. All rights reserved.
179
 
 
 
1
  ---
2
+ title: CopernicusAI - Research-Driven Podcast Generation Platform
3
+ emoji: πŸ”¬
4
+ colorFrom: purple
5
+ colorTo: blue
6
  sdk: static
7
+ pinned: false
8
  license: mit
9
  ---
10
 
11
+ # πŸ”¬ CopernicusAI - Knowledge Engine for Scientific Discovery
12
 
13
+ A collaborative research platform that transforms cutting-edge scientific research into accessible, multi-format tools for collective knowledge exploration. These are research instrumentsβ€”like microscopes for observing the collective knowledge of humanityβ€”enabling hypothesis formation, testing, and discovery across scientific disciplines.
14
 
15
  ## Summary
16
 
17
+ **CopernicusAI** is an operational research platform that synthesizes scientific literature from 250+ million papers into AI-generated podcasts, integrates with a knowledge graph of 12,000+ indexed papers, and provides collaborative tools for research discovery. The system demonstrates production-ready multi-source research synthesis with full citation tracking and evidence-based content generation requiring minimum 3 research sources per episode.
18
 
19
+ The platform includes a fully operational Knowledge Engine Dashboard (deployed December 2025) with interactive knowledge graph visualization, vector search, and RAG capabilities, enabling researchers to explore, query, and synthesize scientific knowledge across disciplines.
20
 
21
+ ## Prior Work: CopernicusAI Research Interface
22
 
23
+ CopernicusAI is an active research prototype exploring AI-generated audio briefings as an interface for assisted scientific research.
24
+
25
+ The system allows any user to generate, refine, and share AI-generated science podcasts based on structured prompts, enabling rapid orientation to a topic, iterative deepening, and personalized research briefings.
26
+
27
+ Rather than functioning as a static content platform, CopernicusAI supports collectively generated and shared research artifacts, analogous to community-driven knowledge platforms (e.g., discussion forums), but grounded in scientific sources and metadata-aware workflows.
28
+
29
+ This work demonstrates technical feasibility for:
30
+ - AI-assisted research briefing and orientation
31
+ - Iterative question refinement via conversational interfaces
32
+ - Integration of text, audio, and metadata in research workflows
33
+
34
+ ### Current Implementation (December 2025)
35
+
36
+ The Knowledge Engine Dashboard is **fully operational** and deployed to Google Cloud Run, providing unified access to all components with interactive knowledge graph visualization, vector search, RAG queries, and content browsing.
37
+
38
+ ## 🎯 Mission & Vision
39
+
40
+ Inspired by Nicolaus Copernicus who challenged accepted knowledge with evidence and rigorous analysis, **CopernicusAI** creates collaborative research tools that enable collective participation in scientific discovery. These platforms are instruments for exploring humanity's collective knowledgeβ€”tools for hypothesis formation, testing, and collaborative research, not just educational content.
41
+
42
+ Just as a microscope enables observation of the microscopic world, CopernicusAI tools enable observation and exploration of humanity's collective knowledge. Subscribers collaborate to prompt, generate, and refine research contentβ€”sharing discoveries publicly or keeping them private. As large language models (LLMs) and AI systems gain unprecedented knowledge, CopernicusAI provides the infrastructure for human-AI collaborative knowledge exploration, with evidence-based truth-seeking as our guiding principle.
43
+
44
+ ---
45
+
46
+ ## 🌟 Core Platform Capabilities
47
+
48
+ ### πŸŽ™οΈ AI-Powered Podcast Generation
49
+
50
+ **Production-Ready System:**
51
+ - Collaborative platform where subscribers prompt and generate multi-voice AI podcasts (5-10 minutes) synthesizing research from multiple academic sources
52
+ - Subscribers can share their podcasts publicly or keep them private
53
+ - Evidence-based content generation requiring minimum 3 research sources per episode
54
+ - Comprehensive research integration across 8+ academic databases
55
+ - **64 episodes** generated across Biology, Chemistry, Computer Science, Mathematics, and Physics
56
+ - Automated audio synthesis with professional multi-speaker dialogue
57
+ - AI-generated episode thumbnails with scientific visualizations
58
+ - RSS feed distribution compatible with Spotify, Apple Podcasts, Google Podcasts
59
+
60
+ **Research Integration:**
61
+ - Real-time discovery from PubMed, arXiv, NASA ADS, Zenodo, bioRxiv, CORE, Google Scholar, and News APIs
62
+ - Parallel search across multiple databases for comprehensive coverage
63
+ - Quality scoring and relevance ranking of research sources
64
+ - Paradigm shift identification and interdisciplinary connection analysis
65
+ - Automatic citation extraction and formatting
66
+ - Source validation and authenticity verification
67
+
68
+ ### πŸ€– Advanced LLM Integration
69
+
70
+ **Multi-Model Architecture:**
71
+ - **Google Gemini 3** - Latest research analysis and content generation
72
+ - **OpenAI GPT-4/GPT-3.5** - Content synthesis and quality validation
73
+ - **Anthropic Claude 3** (Sonnet, Haiku via OpenRouter) - Alternative reasoning paths
74
+ - **ElevenLabs TTS** - Multi-voice text-to-speech synthesis
75
+ - Model selection based on task complexity and expertise level
76
+ - Fallback chains for reliability and cost optimization
77
+
78
+ **Capabilities:**
79
+ - Multi-paper analysis and synthesis
80
+ - Paradigm shift detection in research domains
81
+ - Interdisciplinary connection identification
82
+ - Entity extraction (genes, proteins, chemical compounds, mathematical concepts)
83
+ - Citation tracking and cross-reference analysis
84
+ - Content quality scoring and validation
85
+
86
+ ### πŸ“Š Research Resource Access
87
+
88
+ **Comprehensive Academic Database Coverage:**
89
+
90
+ Our research pipeline integrates with **8+ major academic databases**, providing access to:
91
+
92
+ - **PubMed/NCBI** (~30+ million biomedical papers)
93
+ - **arXiv** (~2+ million preprints in physics, mathematics, CS, quantitative biology)
94
+ - **NASA ADS** (~15+ million astronomy/astrophysics papers)
95
+ - **Zenodo** (100K+ open science datasets and publications)
96
+ - **bioRxiv/medRxiv** (preprints in life sciences)
97
+ - **CORE** (~200+ million open access papers)
98
+ - **Google Scholar** (comprehensive academic search)
99
+ - **News API** (current events and trending research topics)
100
+ - **YouTube Data API** (academic videos, conference talks, lectures)
101
+
102
+ **Total Access:** **250+ million research papers and academic resources** across all major scientific disciplines.
103
+
104
+ ### πŸŽ™οΈ Audio and Video Podcast Production
105
+
106
+ **Operating Audio Podcast System:**
107
+ Full production and distribution platform for subscriber-generated podcasts. Users can prompt, generate, publish, and distribute audio podcasts with RSS feed support for Spotify, Apple Podcasts, and Google Podcasts.
108
+
109
+ - Multi-voice AI podcast generation
110
+ - Research-driven content creation
111
+ - RSS feed distribution
112
+ - Public and private podcast options
113
+ - Professional audio quality
114
+
115
+ **Video Production (Future - Phase 2+):**
116
+
117
+ Advanced video features planned for future development:
118
+
119
+ **Planned Advanced Features (Phase 2-4):**
120
+ - **Visual Content Integration:**
121
+ - Automated extraction of figures and diagrams from research papers
122
+ - Screen capture and processing of academic illustrations
123
+ - Web scraping from scientific journal websites and preprint servers
124
+ - JSON database integration for structured visual data
125
+
126
+ - **Dynamic Visualization Generation:**
127
+ - On-the-fly scientific animations (molecular structures, data flows, algorithms)
128
+ - Real-time chart and graph generation from research data
129
+ - Python-based animations using matplotlib, plotly, mayavi
130
+ - Mathematical formula rendering (LaTeX β†’ video)
131
+
132
+ - **External Video Quoting:**
133
+ - YouTube video segment extraction and integration
134
+ - Time-stamped video quoting with proper attribution
135
+ - Educational fair use compliance
136
+ - Source video discovery during research phase
137
+
138
+ - **Advanced Composition:**
139
+ - Multi-layer video composition (background, content, overlays, effects)
140
+ - Automatic subtitle generation from transcripts
141
+ - Text overlay system (key concepts, citations, speaker identification)
142
+ - Professional transitions and effects
143
+ - Audio-visual synchronization
144
+
145
+ **See:** [Science Video Database](https://huggingface.co/spaces/garywelz/sciencevideodb) - Companion project for research video content management.
146
+
147
+ ### πŸ“š Research Papers Metadata Database (Phase 2)
148
+
149
+ **Planned Implementation:**
150
+ A centralized **metadata repository** (not a file archive) that provides:
151
+
152
+ - **Structured JSON Objects:** Research paper metadata including:
153
+ - DOI, arXiv ID, publication information
154
+ - Abstracts and key findings
155
+ - Extracted entities (genes, proteins, chemical compounds, equations)
156
+ - Citation networks and cross-references
157
+ - Paradigm shift indicators
158
+ - Interdisciplinary connections
159
+ - Quality scores and relevance metrics
160
+
161
+ - **AI-Powered Preprocessing:**
162
+ - LLM-based entity extraction and annotation
163
+ - Automatic categorization by discipline and subdomain
164
+ - Keyword extraction and semantic tagging
165
+ - Citation tracking and relationship mapping
166
+ - Quality assessment and validation
167
+
168
+ - **Integration Features:**
169
+ - DOI/arXiv ID resolution and metadata enrichment
170
+ - Cross-reference linking between papers
171
+ - Podcast-to-paper relationship tracking
172
+ - Search and query capabilities
173
+ - API access for programmatic retrieval
174
+
175
+ **Technical Architecture:**
176
+ - Firestore NoSQL database for flexible JSON storage
177
+ - Google Cloud Functions for automated metadata processing
178
+ - Vertex AI for entity extraction and analysis
179
+ - RESTful API for external access
180
+
181
+ **Benefits:**
182
+ - Enables rapid research discovery across podcasts
183
+ - Supports knowledge graph construction
184
+ - Facilitates cross-disciplinary pattern recognition
185
+ - Provides foundation for semantic search capabilities
186
+
187
+ ---
188
+
189
+ ## πŸ—„οΈ System Architecture
190
+
191
+ ### Database Structure (Firestore)
192
+
193
+ **Collections:**
194
+ - **`subscribers`** - User accounts, preferences, subscription tiers, usage analytics
195
+ - **`podcast_jobs`** - Generated podcasts with full metadata, source papers, engagement metrics
196
+ - **`episodes`** - Published episodes with RSS distribution status
197
+ - **`research_papers`** (Phase 2) - Paper metadata database with AI-extracted entities
198
 
199
+ ### Storage Structure (Google Cloud Storage)
 
 
 
 
200
 
201
+ - **`audio/`** - MP3 podcast files (multi-voice ElevenLabs synthesis)
202
+ - **`videos/`** - MP4 video podcasts (current and future)
203
+ - **`transcripts/`** - Full text transcripts with speaker markers
204
+ - **`descriptions/`** - Markdown descriptions with academic references
205
+ - **`thumbnails/`** - AI-generated episode artwork (DALL-E 3)
206
+ - **`video-assets/`** - Extracted figures, animations, visual content
207
+ - **`glmp-v2/`** - Genome Logic Modeling Project flowcharts (JSON)
208
 
209
+ ### Backend Services (Google Cloud Run)
 
210
 
211
+ **Microservices Architecture:**
212
+ - **Podcast Generation Service** - Orchestrates research, content generation, and media production
213
+ - **Research Pipeline Service** - Multi-API academic search and analysis
214
+ - **Video Generation Service** - Video composition and encoding (Phase 1 complete)
215
+ - **RSS Service** - Feed generation and distribution
216
+ - **Episode Service** - Catalog management and metadata
217
 
218
+ ---
219
 
220
+ ## βš™οΈ Technology Stack
221
+
222
+ ### AI & Machine Learning
223
+ - **Google Gemini 3** - Latest LLM for research analysis
224
+ - **Google Vertex AI** - Enterprise-scale model deployment and orchestration (used throughout platform)
225
+ - **OpenAI GPT-4/GPT-3.5** - Content synthesis and validation
226
+ - **Anthropic Claude 3** - Alternative reasoning via OpenRouter
227
+ - **ElevenLabs TTS** - Multi-voice text-to-speech synthesis
228
+ - **DALL-E 3** - AI-generated scientific visualizations
229
+ - **Google Cloud Vision API** - Image analysis and quality assessment
230
+ - **Video Intelligence API** - Scene detection and content analysis
231
+
232
+ ### Backend Infrastructure
233
+ - **FastAPI** (Python) - RESTful API framework
234
+ - **Google Cloud Run** - Serverless container deployment
235
+ - **Firestore** - NoSQL document database
236
+ - **Cloud Storage** - Media file storage and CDN
237
+ - **Cloud Functions** - Event-driven processing
238
+ - **Cloud Tasks** - Background job queuing
239
+ - **Secret Manager** - API key and credential management
240
+
241
+ ### Media Processing
242
+ - **FFmpeg** - Video encoding and composition
243
+ - **MoviePy** - Python video editing (planned)
244
+ - **Matplotlib/Plotly** - Scientific visualization (planned)
245
+ - **PyPDF2/pdfplumber** - PDF processing (planned)
246
+
247
+ ### Frontend
248
+ - **Next.js 15.5.7** - React framework
249
+ - **Alpine.js** - Lightweight reactive UI
250
+ - **Tailwind CSS** - Utility-first styling
251
+ - **Vercel** - Frontend hosting and deployment
252
 
253
+ ---
254
 
255
+ ## πŸ“ˆ Platform Capabilities
256
 
257
+ ### Research Coverage
258
+ - **250+ million research papers** accessible through integrated APIs
259
+ - **8+ academic databases** integrated with parallel search
260
+ - **Minimum 3 sources** required per episode for quality assurance
261
+ - **Multi-paper analysis** for comprehensive coverage
262
 
263
+ ### Platform Features
264
+ - **Subscriber-driven content generation** - Users prompt and create podcasts
265
+ - **RSS feed distribution** to major podcast platforms
266
+ - **Public and private podcast options** - Share discoveries or keep them private
267
 
268
+ ---
269
 
270
+ ## πŸ”— Live Platform & Resources
 
 
 
271
 
272
+ ### Production Deployment
273
+ - 🏠 **[Homepage - Browse Podcasts](https://www.copernicusai.fyi)** - Public podcast catalog
274
+ - πŸ“Š **[Creator Dashboard](https://www.copernicusai.fyi/subscriber-dashboard.html)** - Subscriber interface
275
+ - πŸ“‘ **[RSS Feed](https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/feeds/copernicus-mvp-rss-feed.xml)** - Podcast distribution feed
276
 
277
+ ## 🧩 CopernicusAI Knowledge Engine Components
 
278
 
279
+ The CopernicusAI Knowledge Engine is an integrated ecosystem of research and collaboration tools. The Knowledge Engine is **fully implemented and operational** (December 2025), with a working system deployed to Google Cloud Run. Currently, the platform includes five core components, with additional tools, databases, and collaboration features planned for future development:
 
280
 
281
+ ### 🎯 Knowledge Engine Implementation (December 2025)
 
282
 
283
+ **Fully Operational System:**
284
+ - **Live Dashboard:** https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine
285
+ - **Knowledge Graph:** Interactive visualization with 12,000+ indexed mathematics papers, relationship extraction (citations, semantic similarity, categories), and graph query capabilities
286
+ - **Vector Search:** Semantic search using Vertex AI embeddings across papers, podcasts, and processes
287
+ - **RAG System:** Retrieval-augmented generation with citation support, context retrieval, and multi-modal content integration
288
+ - **Unified Web Dashboard:** Production-ready interface with knowledge map visualization, search, RAG queries, content browsing, and statistics
289
+ - **Architecture:** FastAPI backend, Next.js frontend, Firestore database, Vertex AI for embeddings and LLM capabilities, Model Context Protocol (MCP) server for AI assistant integration
290
+ - **Deployment:** Fully deployed to Google Cloud Run, accessible 24/7
291
 
292
+ ### Core Components
 
 
293
 
294
+ 1. **πŸ”¬ CopernicusAI (This Platform)** - Core synthesis and distribution component
295
+ - AI-powered research synthesis and podcast generation
296
+ - Multi-API research integration (250+ million papers)
297
+ - Subscriber-driven content creation and sharing
298
+ - RSS feed distribution and platform management
299
 
300
+ 2. **πŸ› οΈ Programming Framework** - Foundational meta-tool
301
+ - Universal method for process analysis across any discipline
302
+ - LLM-powered extraction and Mermaid visualization
303
+ - Domain-agnostic methodology for complex process analysis
304
+ - [Explore Framework β†’](https://huggingface.co/spaces/garywelz/programming_framework)
305
 
306
+ 3. **🧬 GLMP - Genome Logic Modeling Project** - Specialized biological application
307
+ - First application of Programming Framework to biology
308
+ - 50+ biological processes visualized as interactive flowcharts
309
+ - JSON-based structured data in Google Cloud Storage
310
+ - [Explore GLMP β†’](https://huggingface.co/spaces/garywelz/glmp)
311
 
312
+ 4. **πŸ“š Research Paper Metadata Database** - Core data infrastructure
313
+ - Centralized metadata repository for scientific research papers
314
+ - AI-powered preprocessing and entity extraction
315
+ - Citation network analysis and relationship mapping
316
+ - Foundation for knowledge graph construction
317
+ - [Explore Metadata Database β†’](https://huggingface.co/spaces/garywelz/metadata_database)
318
 
319
+ 5. **🎬 Science Video Database** - Multi-modal content component
320
+ - Curated searchable database of scientific video content
321
+ - Transcript-based search across multiple disciplines
322
+ - Integration with YouTube and other video sources
323
+ - [Explore Video Database β†’](https://huggingface.co/spaces/garywelz/sciencevideodb)
324
+ - [Live Demo β†’](https://scienceviddb-web-204731194849.us-central1.run.app/)
325
 
326
+ ### Future Components
 
327
 
328
+ The Knowledge Engine is designed to grow and evolve. Additional tools, databases, and collaboration components will be added as the project develops, expanding capabilities for AI-assisted scientific research and knowledge discovery.
329
+
330
+ ---
331
 
332
+ ## πŸ”Œ API Documentation
333
 
334
+ **Base URL:** `https://copernicus-podcast-api-phzp4ie2sq-uc.a.run.app`
 
 
 
 
335
 
336
+ ### Podcast Generation Endpoints
337
+ - `POST /generate-podcast-with-subscriber` - Generate new podcast from research topic
338
+ - `GET /api/subscribers/podcasts/{id}` - Retrieve podcast details
339
+ - `POST /api/subscribers/podcasts/submit-to-rss` - Publish to RSS feed
340
+
341
+ ### Research Endpoints
342
+ - `POST /api/papers/upload` - Upload paper metadata (Phase 2)
343
+ - `GET /api/papers/{paper_id}` - Retrieve paper metadata
344
+ - `POST /api/papers/query` - Query papers by discipline, keywords
345
+ - `POST /api/papers/{id}/link-podcast/{id}` - Link paper to podcast
346
+
347
+ ### Admin Endpoints
348
+ - `GET /api/admin/subscribers` - List all subscribers and statistics
349
+ - `POST /api/admin/podcasts/fix-missing-titles` - Content maintenance
350
+ - `GET /api/admin/podcasts/catalog` - Full podcast catalog
351
+
352
+ ---
353
 
354
+ ## πŸš€ Development Roadmap
355
+
356
+ ### βœ… Phase 1: Core Platform (Complete)
357
+ - Multi-API research integration
358
+ - AI podcast generation with multi-voice synthesis
359
+ - RSS feed distribution
360
+ - Subscriber platform
361
+ - Basic video generation (static)
362
+
363
+ ### πŸ”„ Phase 2: Content Enhancement (In Progress)
364
+ - **Research Papers Metadata Database** - JSON-based metadata repository
365
+ - **Visual Content Extraction** - Figures from papers, web scraping
366
+ - **YouTube Video Quoting** - External video integration with attribution
367
+ - **Advanced Video Features** - Multi-layer composition, animations
368
+
369
+ ### πŸ“‹ Phase 3: Advanced Visualizations (Planned)
370
+ - Scientific animation generation (matplotlib, plotly)
371
+ - Real-time data visualization
372
+ - Mathematical formula rendering
373
+ - Dynamic graph and network visualizations
374
+
375
+ ### βœ… Phase 4: Knowledge Integration (Implemented - December 2025)
376
+ - **Knowledge Graph:** Fully operational with interactive visualization, 12,000+ papers indexed
377
+ - **Vector Search:** Semantic search implemented using Vertex AI embeddings
378
+ - **RAG System:** Retrieval-augmented generation with citations operational
379
+ - **Cross-Disciplinary Pattern Discovery:** Relationship extraction across papers, concepts, and categories
380
+ - **AI-Powered Content Recommendations:** Integrated into unified web dashboard
381
+ - **Live System:** https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine
382
 
383
+ ---
 
 
 
 
384
 
385
+ ## πŸ”¬ Collaborative Research Tools
386
 
387
+ **These platforms enable collective participation and collaboration across diverse user communities:**
 
388
 
389
+ - **Researchers** - Tools for hypothesis formation and testing, rapid synthesis of cross-disciplinary findings
390
+ - **Collaborators** - Collective knowledge exploration and refinement
391
+ - **Subscribers** - Prompt, generate, and share podcasts (public or private)
392
+ - **Community** - User suggestions, comments, and collaborative flowchart improvement (GLMP)
393
 
394
+ **Key Innovations:**
395
+ - **Multi-Source Validation** - Requires minimum 3 research sources per episode
396
+ - **Evidence-Based Generation** - No content generated without research backing
397
+ - **Paradigm Shift Detection** - Identifies revolutionary vs. incremental research
398
+ - **Interdisciplinary Connections** - Reveals cross-domain insights
399
+ - **Collaborative Participation** - User-driven content generation and sharing
400
+ - **Reproducibility** - Full citation tracking and source attribution
401
 
402
+ > *Like a microscope enables observation of the microscopic world, these tools enable observation and exploration of humanity's collective knowledge.*
403
 
404
+ ---
405
 
406
+ ## πŸ“š Prior Work & Research Contributions
 
 
407
 
408
+ ### Overview
409
+ This platform represents **prior work** that demonstrates foundational research and development achievements in AI-powered scientific knowledge synthesis, collaborative research tools, and multi-modal content generation. These contributions establish the technical foundation and proof-of-concept for the broader **CopernicusAI Knowledge Engine** initiative.
410
+
411
+ ### Research Contributions
412
+
413
+ **1. AI-Powered Research Synthesis System**
414
+ - Developed and deployed a production-ready system for multi-source research synthesis using LLMs
415
+ - Demonstrated integration of 8+ academic databases (250+ million papers) with parallel search capabilities
416
+ - Implemented evidence-based content generation requiring minimum 3 research sources per output
417
+ - Achieved operational deployment with 64+ generated podcast episodes across 5 scientific disciplines
418
+
419
+ **2. Multi-Model LLM Architecture**
420
+ - Designed and implemented intelligent model selection framework using Google Gemini 3, OpenAI GPT-4, and Anthropic Claude 3
421
+ - Developed fallback chains for reliability and cost optimization
422
+ - Demonstrated paradigm shift detection and interdisciplinary connection identification in research domains
423
+ - Implemented entity extraction (genes, proteins, chemical compounds, mathematical concepts) from research literature
424
+
425
+ **3. Collaborative Research Platform Infrastructure**
426
+ - Built subscriber-driven content generation system enabling public/private research sharing
427
+ - Implemented RSS feed distribution compatible with major podcast platforms
428
+ - Developed microservices architecture on Google Cloud Run with Firestore and Cloud Storage
429
+ - Created RESTful API framework for programmatic access to research synthesis capabilities
430
+
431
+ **4. Integration with Knowledge Engine Components**
432
+ - Established integration pathways with GLMP (Genome Logic Modeling Project) for biological process visualization
433
+ - Designed architecture for Research Papers Metadata Database (Phase 2)
434
+ - Planned integration with Science Video Database for multi-modal content
435
+ - Created framework for Programming Framework integration across disciplines
436
+
437
+ ### Technical Achievements
438
+
439
+ **Production Deployment:**
440
+ - Live platform: https://www.copernicusai.fyi
441
+ - Operational API: https://copernicus-podcast-api-phzp4ie2sq-uc.a.run.app
442
+ - RSS feed distribution: Active and functional
443
+ - Multi-voice audio synthesis: ElevenLabs TTS integration operational
444
+
445
+ **Research Infrastructure:**
446
+ - 250+ million research papers accessible via integrated APIs
447
+ - 8+ academic database integrations (PubMed, arXiv, NASA ADS, Zenodo, bioRxiv, CORE, Google Scholar, News API)
448
+ - **12,000+ mathematics papers indexed** with full metadata and vector embeddings in Knowledge Engine
449
+ - Automated citation extraction and formatting
450
+ - Quality scoring and relevance ranking systems
451
+ - **Knowledge Graph:** Fully operational with relationship extraction and interactive visualization
452
+ - **Vector Search:** Semantic search across papers, podcasts, and processes
453
+ - **RAG System:** Operational with citation support and multi-modal content integration
454
+
455
+ **Scalability & Architecture:**
456
+ - Serverless microservices architecture (Google Cloud Run)
457
+ - NoSQL database (Firestore) for flexible metadata storage
458
+ - Cloud Storage for media files and structured data
459
+ - Event-driven processing with Cloud Functions and Cloud Tasks
460
+
461
+ ### Position Within CopernicusAI Knowledge Engine
462
+
463
+ This platform serves as the **core synthesis and distribution component** of the CopernicusAI Knowledge Engine. The Knowledge Engine is an integrated ecosystem of research and collaboration tools that work together to assist scientists in their workflow, from research discovery through knowledge synthesis to multi-format content generation.
464
+
465
+ **Current Components:**
466
+ 1. **CopernicusAI** (This platform) - Core synthesis and distribution component for AI-powered research synthesis and podcast generation
467
+ 2. **Knowledge Engine Dashboard** (βœ… Implemented December 2025) - Fully operational web interface with knowledge graph visualization, vector search, RAG queries, content browsing, and statistics. Live at: https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine
468
+ 3. **Programming Framework** - Foundational meta-tool providing universal process analysis methodology
469
+ 4. **GLMP (Genome Logic Modeling Project)** - Specialized biological application demonstrating domain-specific use of the Programming Framework
470
+ 5. **Research Paper Metadata Database** - Core data infrastructure providing structured metadata and citation networks
471
+ 6. **Science Video Database** - Multi-modal content component enabling video-based learning and research discovery
472
+
473
+ **Future Development:**
474
+ The Knowledge Engine is designed to grow and evolve. Additional tools, databases, and collaboration components will be added as the project develops, expanding capabilities for AI-assisted scientific research and knowledge discovery.
475
+
476
+ ### Academic & Research Impact
477
+
478
+ **Publications & Presentations:**
479
+ - Platform architecture and methodology suitable for academic publication
480
+ - Open-source components available for research community use
481
+ - Publicly accessible research tools demonstrating AI-human collaboration in scientific knowledge synthesis
482
+
483
+ **Research Applications:**
484
+ - Supports hypothesis formation and testing through rapid multi-source synthesis
485
+ - Enables cross-disciplinary pattern recognition and connection identification
486
+ - Facilitates reproducible research communication with full citation tracking
487
+ - Provides infrastructure for collaborative knowledge exploration
488
+
489
+ **Educational Contributions:**
490
+ - 64+ research-driven podcast episodes across Biology, Chemistry, Computer Science, Mathematics, and Physics
491
+ - Evidence-based content requiring minimum 3 academic sources
492
+ - Public and private sharing options for research dissemination
493
+ - Integration with major podcast platforms for broad accessibility
494
+
495
+ ### Citation Information
496
+
497
+ **For Grant Proposals:**
498
+ When citing this work as prior research, please reference:
499
+
500
+ - **Platform Name:** CopernicusAI - Knowledge Engine for Scientific Discovery
501
+ - **URL:** https://huggingface.co/spaces/garywelz/copernicusai
502
+ - **Live Platform:** https://www.copernicusai.fyi
503
+ - **Primary Developer:** Gary Welz
504
+ - **Year:** 2024-2025
505
+ - **License:** MIT
506
+
507
+ **Suggested Citation Format:**
508
+ ```
509
+ Welz, G. (2025). CopernicusAI: Knowledge Engine for Scientific Discovery.
510
+ Hugging Face Space. https://huggingface.co/spaces/garywelz/copernicusai
511
+ ```
512
+
513
+ ## 🌐 Grant Support & Collaboration
514
+
515
+ **Grant Applications Supported:**
516
+ This platform is designed to support grant applications to:
517
+ - **NSF (National Science Foundation)** - Science education and research infrastructure
518
+ - **DOE (Department of Energy)** - Scientific computing and data science
519
+ - **SAIR Foundation** - AI research and development initiatives
520
+
521
+ **Research Contributions:**
522
+ - Open-source components and methodologies
523
+ - Publicly accessible research tools
524
+ - Educational content for broader scientific literacy
525
+ - Infrastructure for reproducible research communication
526
+
527
+ **Collaboration Opportunities:**
528
+ - Integration with academic institutions
529
+ - Partnership with research organizations
530
+ - Open data initiatives
531
+ - Educational program development
532
 
533
+ ---
534
 
535
+ ## How to Cite This Work
 
 
 
 
536
 
537
+ Welz, G. (2024–2025). *CopernicusAI: AI-Generated Audio Briefings as a Research Interface*.
538
+ Hugging Face Spaces. https://huggingface.co/spaces/garywelz/copernicusai
539
 
540
+ ---
541
 
542
+ ## πŸ“„ License & Attribution
 
 
 
 
543
 
544
+ **License:** MIT
545
 
546
+ **Attributions:**
547
+ - Built with Google Cloud Platform, Gemini AI, OpenAI, Anthropic Claude, and ElevenLabs
548
+ - Research data from PubMed, arXiv, NASA ADS, Zenodo, bioRxiv, CORE, and Google Scholar
549
+ - Academic paper metadata from respective publishers
550
 
551
  ---
552
 
553
+ ## πŸ“§ Contact & Support
554
+
555
+ For questions, collaboration inquiries, or grant application support:
556
+ - **Hugging Face Space:** [https://huggingface.co/spaces/garywelz/copernicusai](https://huggingface.co/spaces/garywelz/copernicusai)
557
+ - **Platform:** [https://www.copernicusai.fyi](https://www.copernicusai.fyi)
558
+
559
+ ---
560
 
561
+ **Β© 2025 CopernicusAI. All rights reserved.**
562
 
563
+ *Advancing scientific knowledge through AI-powered research communication and discovery.*
index.html CHANGED
@@ -3,12 +3,11 @@
3
  <head>
4
  <meta charset="UTF-8">
5
  <meta name="viewport" content="width=device-width, initial-scale=1.0">
6
- <title>The Programming Framework - Universal Process Analysis</title>
7
  <script src="https://cdn.tailwindcss.com"></script>
8
- <script src="https://cdn.jsdelivr.net/npm/mermaid/dist/mermaid.min.js"></script>
9
  <style>
10
  .gradient-bg {
11
- background: linear-gradient(135deg, #f59e0b 0%, #d97706 100%);
12
  }
13
  .card-hover {
14
  transition: transform 0.3s ease, box-shadow 0.3s ease;
@@ -17,6 +16,13 @@
17
  transform: translateY(-4px);
18
  box-shadow: 0 20px 40px rgba(0,0,0,0.15);
19
  }
 
 
 
 
 
 
 
20
  </style>
21
  </head>
22
  <body class="bg-gray-50">
@@ -24,12 +30,14 @@
24
  <header class="gradient-bg text-white">
25
  <div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-16">
26
  <div class="text-center">
27
- <div class="text-6xl mb-4">πŸ› οΈ</div>
28
- <h1 class="text-5xl font-bold mb-4">The Programming Framework</h1>
29
- <p class="text-xl opacity-90 mb-6">A Universal Method for Process Analysis</p>
30
- <p class="text-lg opacity-75 max-w-3xl mx-auto">
31
- Combining Large Language Models with Mermaid visualization to dissect and understand
32
- complex processes across any disciplineβ€”from biology to business, physics to psychology.
 
 
33
  </p>
34
  </div>
35
  </div>
@@ -37,428 +45,685 @@
37
 
38
  <!-- Abstract/Summary -->
39
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
40
- <div class="bg-white rounded-xl shadow-lg p-8 mb-8 border-l-4 border-orange-600">
41
  <h2 class="text-2xl font-bold text-gray-900 mb-4">πŸ“‹ Summary</h2>
42
  <p class="text-lg text-gray-700 leading-relaxed mb-3">
43
- The <strong>Programming Framework</strong> is a universal meta-tool for analyzing complex processes across any discipline by combining Large Language Models (LLMs) with visual flowchart representation. The Framework transforms textual process descriptions into structured, interactive Mermaid flowcharts stored as JSON, enabling systematic analysis, visualization, and integration with knowledge systems.
44
  </p>
45
  <p class="text-gray-600">
46
- Successfully demonstrated through GLMP (Genome Logic Modeling Project) with 50+ biological processes, and applied across Chemistry, Mathematics, Physics, and Computer Science. The Framework serves as the foundational methodology for the CopernicusAI Knowledge Engine, enabling domain-specific process visualization and analysis.
47
  </p>
48
  </div>
49
  </section>
50
 
51
- <!-- Prior Work & Research Contributions -->
52
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
53
- <div class="bg-gradient-to-r from-orange-50 to-yellow-50 rounded-xl shadow-lg p-8 mb-8">
54
- <h2 class="text-3xl font-bold text-gray-900 mb-6">πŸ“š Prior Work & Research Contributions</h2>
55
 
56
  <div class="bg-white rounded-lg p-6 mb-6">
57
- <h3 class="text-xl font-semibold text-gray-900 mb-4">Overview</h3>
 
 
 
58
  <p class="text-gray-700 mb-4">
59
- The Programming Framework represents <strong>prior work</strong> that demonstrates a novel methodology for analyzing complex processes by combining Large Language Models (LLMs) with visual flowchart representation. This research establishes a universal, domain-agnostic approach to process analysis that transforms textual descriptions into structured, interactive visualizations.
60
  </p>
61
- </div>
62
-
63
- <div class="grid md:grid-cols-2 gap-6 mb-6">
64
- <div class="bg-white rounded-lg p-6">
65
- <h3 class="text-lg font-semibold text-gray-900 mb-3">πŸ”¬ Research Contributions</h3>
66
- <ul class="text-sm text-gray-700 space-y-2">
67
- <li>β€’ <strong>Universal Process Analysis:</strong> Domain-agnostic methodology applicable across multiple fields</li>
68
- <li>β€’ <strong>LLM-Powered Extraction:</strong> Automated extraction using Google Gemini 2.0 Flash</li>
69
- <li>β€’ <strong>Structured Visualization:</strong> Mermaid.js-based flowchart generation encoded as JSON</li>
70
- <li>β€’ <strong>Iterative Refinement:</strong> Systematic approach enabling continuous improvement</li>
71
- </ul>
72
- </div>
73
-
74
- <div class="bg-white rounded-lg p-6">
75
- <h3 class="text-lg font-semibold text-gray-900 mb-3">βš™οΈ Technical Achievements</h3>
76
- <ul class="text-sm text-gray-700 space-y-2">
77
- <li>β€’ <strong>Meta-Tool Architecture:</strong> Framework for creating specialized analysis tools</li>
78
- <li>β€’ <strong>JSON-Based Storage:</strong> Structured format enabling version control and API integration</li>
79
- <li>β€’ <strong>Multi-Domain Application:</strong> Successfully applied to biological processes (GLMP)</li>
80
- <li>β€’ <strong>Integration Framework:</strong> Designed for knowledge engines and collaborative platforms</li>
81
  </ul>
82
  </div>
83
  </div>
84
-
85
- <div class="bg-white rounded-lg p-6">
86
- <h3 class="text-lg font-semibold text-gray-900 mb-3">🎯 Position Within CopernicusAI Knowledge Engine</h3>
87
  <p class="text-gray-700 mb-3">
88
- The Programming Framework serves as the <strong>foundational meta-tool</strong> of the CopernicusAI Knowledge Engine, providing the underlying methodology that enables specialized applications:
89
  </p>
90
- <div class="grid md:grid-cols-2 gap-4 text-sm mb-3">
91
- <ul class="text-gray-700 space-y-1">
92
- <li>β€’ GLMP (Genome Logic Modeling Project)</li>
93
- <li>β€’ Copernicus AI (main knowledge engine)</li>
94
- <li>β€’ Research Papers Metadata Database</li>
95
- </ul>
96
- <ul class="text-gray-700 space-y-1">
97
- <li>β€’ Science Video Database</li>
98
- <li>β€’ Multi-domain process analysis</li>
99
- </ul>
100
- </div>
101
- <p class="text-gray-600 text-sm italic">
102
- This work establishes a proof-of-concept for AI-assisted process analysis, demonstrating how LLMs can systematically extract and visualize complex logic from textual sources across diverse domains.
103
  </p>
104
  </div>
105
  </div>
106
  </section>
107
 
108
- <!-- Key Stats -->
109
- <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 -mt-8">
110
- <div class="grid md:grid-cols-4 gap-4">
111
- <div class="bg-white rounded-lg shadow-lg p-6 text-center">
112
- <div class="text-3xl font-bold text-orange-600">Any</div>
113
- <div class="text-sm text-gray-600">Discipline</div>
114
- </div>
115
- <div class="bg-white rounded-lg shadow-lg p-6 text-center">
116
- <div class="text-3xl font-bold text-blue-600">LLM</div>
117
- <div class="text-sm text-gray-600">Powered</div>
118
- </div>
119
- <div class="bg-white rounded-lg shadow-lg p-6 text-center">
120
- <div class="text-3xl font-bold text-green-600">Visual</div>
121
- <div class="text-sm text-gray-600">Flowcharts</div>
122
- </div>
123
- <div class="bg-white rounded-lg shadow-lg p-6 text-center">
124
- <div class="text-3xl font-bold text-purple-600">JSON</div>
125
- <div class="text-sm text-gray-600">Structured Data</div>
126
- </div>
127
  </div>
128
  </section>
129
 
130
- <!-- What is the Framework -->
131
- <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
132
- <div class="bg-white rounded-xl shadow-lg p-8">
133
- <h2 class="text-3xl font-bold text-gray-900 mb-6">🎯 What is the Programming Framework?</h2>
134
- <div class="prose max-w-none">
135
- <p class="text-lg text-gray-700 mb-4">
136
- The Programming Framework is a <strong>meta-tool</strong>β€”a tool for creating tools. It provides a
137
- systematic method for analyzing any complex process by combining the analytical power of Large Language
138
- Models with the clarity of visual flowcharts.
 
 
 
 
 
 
 
 
139
  </p>
140
-
141
- <div class="grid md:grid-cols-2 gap-6 mt-6">
142
- <div class="bg-orange-50 rounded-lg p-6">
143
- <h3 class="text-xl font-semibold text-gray-900 mb-3">πŸ” The Problem</h3>
144
- <p class="text-gray-700">
145
- Complex processesβ€”whether biological, computational, or organizationalβ€”are difficult to
146
- understand because they involve many steps, decision points, and interactions. Traditional
147
- descriptions in text are hard to follow.
148
- </p>
 
149
  </div>
150
-
151
- <div class="bg-blue-50 rounded-lg p-6">
152
- <h3 class="text-xl font-semibold text-gray-900 mb-3">✨ The Solution</h3>
153
- <p class="text-gray-700">
154
- Use LLMs to extract process logic from literature, then encode it as Mermaid flowcharts
155
- stored in JSON. Result: Clear, interactive visualizations that reveal hidden patterns and
156
- enable systematic analysis.
 
 
 
 
157
  </p>
158
  </div>
159
  </div>
160
  </div>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
161
  </div>
162
  </section>
163
 
164
- <!-- How It Works -->
165
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
166
- <div class="bg-gradient-to-r from-orange-50 to-yellow-50 rounded-xl p-8">
167
- <h2 class="text-3xl font-bold text-gray-900 mb-8 text-center">βš™οΈ How It Works</h2>
168
-
169
- <div class="grid md:grid-cols-4 gap-6">
170
- <div class="bg-white rounded-lg p-6 text-center">
171
- <div class="text-4xl mb-3">1️⃣</div>
172
- <h3 class="font-semibold text-gray-900 mb-2">Input Process</h3>
173
- <p class="text-sm text-gray-600">Provide scientific papers, documentation, or process descriptions</p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
174
  </div>
 
175
 
176
- <div class="bg-white rounded-lg p-6 text-center">
177
- <div class="text-4xl mb-3">2️⃣</div>
178
- <h3 class="font-semibold text-gray-900 mb-2">LLM Analysis</h3>
179
- <p class="text-sm text-gray-600">AI extracts steps, decisions, branches, and logic flow</p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
180
  </div>
 
181
 
182
- <div class="bg-white rounded-lg p-6 text-center">
183
- <div class="text-4xl mb-3">3️⃣</div>
184
- <h3 class="font-semibold text-gray-900 mb-2">Generate Flowchart</h3>
185
- <p class="text-sm text-gray-600">Create Mermaid diagram encoded as JSON structure</p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
186
  </div>
 
187
 
188
- <div class="bg-white rounded-lg p-6 text-center">
189
- <div class="text-4xl mb-3">4️⃣</div>
190
- <h3 class="font-semibold text-gray-900 mb-2">Visualize & Iterate</h3>
191
- <p class="text-sm text-gray-600">Interactive flowchart reveals insights and enables refinement</p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
192
  </div>
193
  </div>
194
 
195
- <div class="mt-8 bg-white rounded-lg p-6">
196
- <h4 class="font-semibold text-gray-900 mb-3">πŸ“Š Live Example:</h4>
197
- <div id="exampleDiagram" class="mermaid">
198
- graph TD
199
- A[Complex Process Input] --> B{LLM Analysis}
200
- B -->|Extract Logic| C[Identify Steps]
201
- B -->|Extract Decisions| D[Identify Branches]
202
- C --> E[Create Flowchart Nodes]
203
- D --> F[Create Decision Points]
204
- E --> G[Generate Mermaid Syntax]
205
- F --> G
206
- G --> H[Store as JSON]
207
- H --> I[Interactive Visualization]
208
- I --> J{Insights Gained?}
209
- J -->|No| K[Refine Analysis]
210
- J -->|Yes| L[Apply Knowledge]
211
- K --> B
212
-
213
- style A fill:#ff6b6b,color:#fff
214
- style B fill:#74c0fc,color:#fff
215
- style C fill:#51cf66,color:#fff
216
- style D fill:#51cf66,color:#fff
217
- style E fill:#ffd43b,color:#000
218
- style F fill:#ffd43b,color:#000
219
- style G fill:#51cf66,color:#fff
220
- style H fill:#74c0fc,color:#fff
221
- style I fill:#74c0fc,color:#fff
222
- style J fill:#74c0fc,color:#fff
223
- style K fill:#51cf66,color:#fff
224
- style L fill:#b197fc,color:#fff
225
- </div>
226
- <div class="mt-4 text-sm text-gray-600">
227
- <p class="font-semibold mb-2">Color Legend:</p>
228
- <div class="flex flex-wrap gap-4">
229
- <span><span class="inline-block w-4 h-4 rounded" style="background-color: #ff6b6b;"></span> Red - Triggers & Inputs</span>
230
- <span><span class="inline-block w-4 h-4 rounded" style="background-color: #ffd43b;"></span> Yellow - Structures & Objects</span>
231
- <span><span class="inline-block w-4 h-4 rounded" style="background-color: #51cf66;"></span> Green - Processing & Operations</span>
232
- <span><span class="inline-block w-4 h-4 rounded" style="background-color: #74c0fc;"></span> Blue - Intermediates & States</span>
233
- <span><span class="inline-block w-4 h-4 rounded" style="background-color: #b197fc;"></span> Violet - Products & Outputs</span>
234
  </div>
235
  </div>
236
  </div>
237
  </div>
238
  </section>
239
 
240
- <!-- Core Principles -->
241
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
242
- <h2 class="text-3xl font-bold text-gray-900 mb-6">πŸ’‘ Core Principles</h2>
243
-
244
- <div class="grid md:grid-cols-3 gap-6">
245
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
246
- <div class="text-4xl mb-3">🌍</div>
247
- <h3 class="text-xl font-semibold text-gray-900 mb-3">Domain Agnostic</h3>
248
- <p class="text-gray-600">
249
- Works across any field: biology, chemistry, software engineering, business processes,
250
- legal workflows, manufacturing, and beyond.
251
- </p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
252
  </div>
253
-
254
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
255
- <div class="text-4xl mb-3">πŸ”„</div>
256
- <h3 class="text-xl font-semibold text-gray-900 mb-3">Iterative Refinement</h3>
257
- <p class="text-gray-600">
258
- Start with rough analysis, visualize, identify gaps, refine with LLM, repeat until
259
- the process logic is crystal clear.
260
  </p>
 
 
 
 
 
 
 
261
  </div>
262
-
263
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
264
- <div class="text-4xl mb-3">πŸ“¦</div>
265
- <h3 class="text-xl font-semibold text-gray-900 mb-3">Structured Data</h3>
266
- <p class="text-gray-600">
267
- JSON storage enables programmatic access, version control, cross-referencing,
268
- and integration with other tools and databases.
269
  </p>
 
 
 
 
 
 
 
270
  </div>
271
  </div>
272
  </section>
273
 
274
-
275
- <!-- Process Diagram Collections -->
276
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
277
- <div class="bg-gradient-to-r from-green-50 to-blue-50 rounded-xl shadow-lg p-8">
278
- <h2 class="text-3xl font-bold text-gray-900 mb-6">πŸ“š Process Diagram Collections</h2>
279
- <p class="text-gray-700 mb-6">
280
- The Programming Framework has been applied across multiple scientific disciplines. Explore interactive flowchart collections organized by domain:
281
- </p>
282
 
283
- <div class="grid md:grid-cols-2 lg:grid-cols-3 gap-6">
284
- <!-- Biology -->
285
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
286
- <h3 class="text-xl font-semibold text-gray-900 mb-4 flex items-center">
287
- <span class="text-2xl mr-2">🧬</span> Biology
288
- </h3>
289
- <p class="text-gray-600 text-sm mb-3">
290
- Biological process visualizations: GLMP covers biochemical/molecular processes; Biology Database covers higher-level organismal processes.
291
- </p>
292
- <div class="space-y-2">
293
- <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/biology-processes-database/biology-database-table.html"
294
- class="text-orange-600 hover:text-orange-700 font-medium text-sm inline-block mr-4"
295
- target="_blank" rel="noopener noreferrer">
296
- πŸ—„οΈ Biology Processes Database β†’
297
- </a>
298
- <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/glmp-database-table.html"
299
- class="text-orange-600 hover:text-orange-700 font-medium text-sm inline-block"
300
- target="_blank" rel="noopener noreferrer">
301
- πŸ—„οΈ GLMP Database Table β†’
302
- </a>
303
- </div>
304
- <p class="text-xs text-gray-500 mt-2">
305
- Biology Database: 52 processes (organismal/ecological) | GLMP: 50+ processes (biochemical/molecular)
306
- </p>
307
- </div>
308
-
309
- <!-- Chemistry -->
310
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
311
- <h3 class="text-xl font-semibold text-gray-900 mb-4 flex items-center">
312
- <span class="text-2xl mr-2">βš—οΈ</span> Chemistry
313
- </h3>
314
- <p class="text-gray-600 text-sm mb-3">
315
- Comprehensive chemistry process diagrams across all major branches.
316
- </p>
317
- <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/chemistry-processes-database/chemistry-database-table.html"
318
- class="text-orange-600 hover:text-orange-700 font-medium text-sm inline-block"
319
- target="_blank" rel="noopener noreferrer">
320
- πŸ—„οΈ Chemistry Database Table β†’
321
- </a>
322
- <p class="text-xs text-gray-500 mt-2">
323
- 56 processes across 14 subcategories
324
- </p>
325
- </div>
326
-
327
- <!-- Mathematics -->
328
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
329
- <h3 class="text-xl font-semibold text-gray-900 mb-4 flex items-center">
330
- <span class="text-2xl mr-2">πŸ”’</span> Mathematics
331
- </h3>
332
- <p class="text-gray-600 text-sm mb-3">
333
- Mathematical algorithms, proof methods, and computational processes.
334
- </p>
335
- <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/mathematics-processes-database/mathematics-database-table.html"
336
- class="text-orange-600 hover:text-orange-700 font-medium text-sm inline-block"
337
- target="_blank" rel="noopener noreferrer">
338
- πŸ—„οΈ Mathematics Database Table β†’
339
- </a>
340
- <p class="text-xs text-gray-500 mt-2">
341
- 20 processes across 7 subcategories
342
- </p>
343
  </div>
344
-
345
- <!-- Physics -->
346
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
347
- <h3 class="text-xl font-semibold text-gray-900 mb-4 flex items-center">
348
- <span class="text-2xl mr-2">βš›οΈ</span> Physics
349
- </h3>
350
- <p class="text-gray-600 text-sm mb-3">
351
- Physical processes including quantum mechanics, thermodynamics, and particle physics.
352
- </p>
353
- <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/physics-processes-database/physics-database-table.html"
354
- class="text-orange-600 hover:text-orange-700 font-medium text-sm inline-block"
355
- target="_blank" rel="noopener noreferrer">
356
- πŸ—„οΈ Physics Database Table β†’
357
- </a>
358
- <p class="text-xs text-gray-500 mt-2">
359
- 21 processes across 7 subcategories
360
- </p>
361
  </div>
362
-
363
- <!-- Computer Science -->
364
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
365
- <h3 class="text-xl font-semibold text-gray-900 mb-4 flex items-center">
366
- <span class="text-2xl mr-2">πŸ’»</span> Computer Science
367
- </h3>
368
- <p class="text-gray-600 text-sm mb-3">
369
- Algorithms, software engineering workflows, and computational processes.
370
- </p>
371
- <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/computer-science-processes-database/computer-science-database-table.html"
372
- class="text-orange-600 hover:text-orange-700 font-medium text-sm inline-block"
373
- target="_blank" rel="noopener noreferrer">
374
- πŸ—„οΈ Computer Science Database Table β†’
375
- </a>
376
- <p class="text-xs text-gray-500 mt-2">
377
- 21 processes across 7 subcategories
378
- </p>
379
  </div>
380
  </div>
381
  </div>
382
  </section>
383
 
384
- <!-- Technical Details -->
385
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
386
- <div class="bg-gray-900 text-white rounded-xl p-8">
387
- <h2 class="text-3xl font-bold mb-6">βš™οΈ Technical Architecture</h2>
388
 
389
- <div class="grid md:grid-cols-2 gap-8">
390
- <div>
391
- <h3 class="text-xl font-semibold text-orange-400 mb-4">πŸ€– LLM Integration</h3>
392
- <ul class="space-y-2 text-gray-300">
393
- <li>β€’ Google Gemini 2.0 Flash for analysis</li>
394
- <li>β€’ Vertex AI for enterprise deployment</li>
395
- <li>β€’ Custom prompts for process extraction</li>
396
- <li>β€’ Structured JSON output formatting</li>
 
397
  </ul>
398
  </div>
399
-
400
- <div>
401
- <h3 class="text-xl font-semibold text-orange-400 mb-4">πŸ“Š Visualization Stack</h3>
402
- <ul class="space-y-2 text-gray-300">
403
- <li>β€’ Mermaid.js for flowchart rendering</li>
404
- <li>β€’ JSON schema for data validation</li>
405
- <li>β€’ Interactive SVG output</li>
406
- <li>β€’ Export to PNG/PDF supported</li>
 
 
407
  </ul>
408
  </div>
 
 
 
409
 
 
 
 
 
 
 
410
  <div>
411
- <h3 class="text-xl font-semibold text-orange-400 mb-4">πŸ’Ύ Data Storage</h3>
412
- <ul class="space-y-2 text-gray-300">
413
- <li>β€’ Google Cloud Storage for JSON files</li>
414
- <li>β€’ Firestore for metadata indexing</li>
415
- <li>β€’ Version control with Git</li>
416
- <li>β€’ Cross-referencing with papers database</li>
 
 
 
417
  </ul>
 
 
 
 
418
  </div>
419
-
420
  <div>
421
- <h3 class="text-xl font-semibold text-orange-400 mb-4">πŸ”— Integration Points</h3>
422
- <ul class="space-y-2 text-gray-300">
423
- <li>β€’ GLMP specialized collections</li>
424
- <li>β€’ Copernicus AI knowledge graph</li>
425
- <li>β€’ Research papers database</li>
426
- <li>β€’ API endpoints for programmatic access</li>
 
 
427
  </ul>
428
  </div>
429
  </div>
430
  </div>
431
  </section>
432
 
433
- <!-- Related Projects -->
434
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
435
- <h2 class="text-3xl font-bold text-gray-900 mb-6 text-center">πŸ”— Related Projects</h2>
436
-
437
- <div class="grid md:grid-cols-2 gap-6">
438
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
439
- <h3 class="text-xl font-semibold text-gray-900 mb-3">🧬 GLMP - Genome Logic Modeling</h3>
440
- <p class="text-gray-600 mb-4">
441
- First specialized application of the Programming Framework to biochemical processes.
442
- 100+ biological pathways visualized as interactive flowcharts.
 
 
443
  </p>
444
- <a href="https://huggingface.co/spaces/garywelz/glmp"
445
- class="text-orange-600 hover:text-orange-700 font-semibold"
446
- target="_blank" rel="noopener noreferrer">
447
- Explore GLMP β†’ (opens in new tab)
448
- </a>
449
  </div>
450
 
451
- <div class="bg-white rounded-lg shadow-md p-6 card-hover">
452
- <h3 class="text-xl font-semibold text-gray-900 mb-3">πŸ”¬ Copernicus AI</h3>
453
- <p class="text-gray-600 mb-4">
454
- Knowledge engine integrating the Programming Framework with AI podcasts, research papers,
455
- and knowledge graph for scientific discovery.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
456
  </p>
457
- <a href="https://www.copernicusai.fyi"
458
- class="text-orange-600 hover:text-orange-700 font-semibold"
459
- target="_blank" rel="noopener noreferrer">
460
- Visit Copernicus AI β†’ (opens in new tab)
461
- </a>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
462
  </div>
463
  </div>
464
  </section>
@@ -467,37 +732,188 @@
467
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
468
  <div class="bg-white rounded-xl shadow-lg p-8">
469
  <h2 class="text-3xl font-bold text-gray-900 mb-6">How to Cite This Work</h2>
470
- <div class="bg-gray-50 rounded-lg p-6 mb-4 space-y-4">
471
- <div>
472
- <p class="text-gray-800 font-mono text-lg leading-relaxed mb-3">
473
- Welz, G. (2024–2025). <em>The Programming Framework: A Universal Method for Process Analysis</em>.<br>
474
- Hugging Face Spaces. <a href="https://huggingface.co/spaces/garywelz/programming_framework" class="text-orange-600 hover:text-orange-700 underline" target="_blank" rel="noopener noreferrer">https://huggingface.co/spaces/garywelz/programming_framework</a> (opens in new tab)
475
- </p>
476
- <div class="bg-white rounded p-3 mt-3">
477
- <p class="text-sm font-semibold text-gray-700 mb-2">BibTeX Format:</p>
478
- <pre class="bg-gray-900 text-orange-400 p-3 rounded text-xs overflow-x-auto"><code>@misc{welz2025programmingframework,
479
- title={The Programming Framework: A Universal Method for Process Analysis},
480
  author={Welz, Gary},
481
  year={2024--2025},
482
- url={https://huggingface.co/spaces/garywelz/programming_framework},
483
- note={Hugging Face Spaces}
484
  }</code></pre>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
485
  </div>
486
  </div>
487
- <div class="border-t border-gray-300 pt-4">
488
- <p class="text-gray-800 font-mono text-lg leading-relaxed">
489
- Welz, G. (2024). <em>From Inspiration to AI: Biology as Visual Programming</em>.<br>
490
- Medium. <a href="https://medium.com/@garywelz_47126/from-inspiration-to-ai-biology-as-visual-programming-520ee523029a" class="text-orange-600 hover:text-orange-700 underline" target="_blank" rel="noopener noreferrer">https://medium.com/@garywelz_47126/from-inspiration-to-ai-biology-as-visual-programming-520ee523029a</a> (opens in new tab)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
491
  </p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
492
  </div>
493
  </div>
494
- <div class="bg-orange-50 rounded-lg p-4">
495
- <p class="text-gray-700 mb-2">
496
- This project serves as a foundational meta-tool for AI-assisted process analysis, enabling systematic extraction and visualization of complex logic from textual sources across diverse scientific and technical domains.
497
- </p>
498
- <p class="text-gray-700 font-semibold">
499
- The Programming Framework is designed as infrastructure for AI-assisted science, providing a universal methodology that can be specialized for domain-specific applications.
500
- </p>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
501
  </div>
502
  </div>
503
  </section>
@@ -505,15 +921,10 @@
505
  <!-- Footer -->
506
  <footer class="gradient-bg text-white py-8 mt-12">
507
  <div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 text-center">
508
- <p class="text-lg font-semibold mb-2">The Programming Framework</p>
509
- <p class="text-sm opacity-75">A Universal Method for Process Analysis</p>
510
- <p class="text-xs opacity-50 mt-4">&copy; 2025 Gary Welz. All rights reserved.</p>
511
  </div>
512
  </footer>
513
-
514
- <script>
515
- mermaid.initialize({ startOnLoad: true, theme: 'default' });
516
- </script>
517
  </body>
518
  </html>
519
-
 
3
  <head>
4
  <meta charset="UTF-8">
5
  <meta name="viewport" content="width=device-width, initial-scale=1.0">
6
+ <title>CopernicusAI - Research-Driven Podcast Generation Platform</title>
7
  <script src="https://cdn.tailwindcss.com"></script>
 
8
  <style>
9
  .gradient-bg {
10
+ background: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
11
  }
12
  .card-hover {
13
  transition: transform 0.3s ease, box-shadow 0.3s ease;
 
16
  transform: translateY(-4px);
17
  box-shadow: 0 20px 40px rgba(0,0,0,0.15);
18
  }
19
+ .stat-number {
20
+ font-size: 2.5rem;
21
+ font-weight: bold;
22
+ background: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
23
+ -webkit-background-clip: text;
24
+ -webkit-text-fill-color: transparent;
25
+ }
26
  </style>
27
  </head>
28
  <body class="bg-gray-50">
 
30
  <header class="gradient-bg text-white">
31
  <div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-16">
32
  <div class="text-center">
33
+ <div class="text-6xl mb-4">πŸ”¬</div>
34
+ <h1 class="text-5xl font-bold mb-4">CopernicusAI</h1>
35
+ <p class="text-xl opacity-90 mb-6">Knowledge Engine for Scientific Discovery</p>
36
+ <p class="text-lg opacity-75 max-w-4xl mx-auto">
37
+ A collaborative research platform that transforms cutting-edge scientific research into accessible,
38
+ multi-format tools for collective knowledge exploration. These are research instrumentsβ€”like microscopes
39
+ for observing the collective knowledge of humanityβ€”enabling hypothesis formation, testing, and discovery
40
+ across scientific disciplines.
41
  </p>
42
  </div>
43
  </div>
 
45
 
46
  <!-- Abstract/Summary -->
47
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
48
+ <div class="bg-white rounded-xl shadow-lg p-8 mb-8 border-l-4 border-purple-600">
49
  <h2 class="text-2xl font-bold text-gray-900 mb-4">πŸ“‹ Summary</h2>
50
  <p class="text-lg text-gray-700 leading-relaxed mb-3">
51
+ <strong>CopernicusAI</strong> is an operational research platform that synthesizes scientific literature from 250+ million papers into AI-generated podcasts, integrates with a knowledge graph of 12,000+ indexed papers, and provides collaborative tools for research discovery. The system demonstrates production-ready multi-source research synthesis with full citation tracking and evidence-based content generation requiring minimum 3 research sources per episode.
52
  </p>
53
  <p class="text-gray-600">
54
+ The platform includes a fully operational Knowledge Engine Dashboard (deployed December 2025) with interactive knowledge graph visualization, vector search, and RAG capabilities, enabling researchers to explore, query, and synthesize scientific knowledge across disciplines.
55
  </p>
56
  </div>
57
  </section>
58
 
59
+ <!-- Prior Work: CopernicusAI Research Interface -->
60
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
61
+ <div class="bg-gradient-to-r from-purple-50 to-blue-50 rounded-xl shadow-lg p-8 mb-8">
62
+ <h2 class="text-3xl font-bold text-gray-900 mb-4">Prior Work & Current Status</h2>
63
 
64
  <div class="bg-white rounded-lg p-6 mb-6">
65
+ <h3 class="text-xl font-semibold text-gray-900 mb-3">Prior Work (2024-2025)</h3>
66
+ <p class="text-lg text-gray-700 leading-relaxed mb-4">
67
+ CopernicusAI is an active research prototype exploring AI-generated audio briefings as an interface for assisted scientific research.
68
+ </p>
69
  <p class="text-gray-700 mb-4">
70
+ The system allows any user to generate, refine, and share AI-generated science podcasts based on structured prompts, enabling rapid orientation to a topic, iterative deepening, and personalized research briefings.
71
  </p>
72
+ <p class="text-gray-700 mb-4">
73
+ Rather than functioning as a static content platform, CopernicusAI supports collectively generated and shared research artifacts, analogous to community-driven knowledge platforms (e.g., discussion forums), but grounded in scientific sources and metadata-aware workflows.
74
+ </p>
75
+ <div class="bg-blue-50 rounded-lg p-4 mt-4">
76
+ <h3 class="font-semibold text-gray-900 mb-2">This work demonstrates technical feasibility for:</h3>
77
+ <ul class="text-gray-700 space-y-1">
78
+ <li>β€’ AI-assisted research briefing and orientation</li>
79
+ <li>β€’ Iterative question refinement via conversational interfaces</li>
80
+ <li>β€’ Integration of text, audio, and metadata in research workflows</li>
 
 
 
 
 
 
 
 
 
 
 
81
  </ul>
82
  </div>
83
  </div>
84
+
85
+ <div class="bg-green-50 border-2 border-green-200 rounded-lg p-6">
86
+ <h3 class="text-xl font-semibold text-gray-900 mb-3">Current Implementation (December 2025)</h3>
87
  <p class="text-gray-700 mb-3">
88
+ The Knowledge Engine Dashboard is <strong>fully operational</strong> and deployed to Google Cloud Run, providing unified access to all components with interactive knowledge graph visualization, vector search, RAG queries, and content browsing.
89
  </p>
90
+ <p class="text-sm text-gray-600">
91
+ See the "Knowledge Engine Ecosystem" section below for details.
 
 
 
 
 
 
 
 
 
 
 
92
  </p>
93
  </div>
94
  </div>
95
  </section>
96
 
97
+ <!-- Mission & Vision -->
98
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
99
+ <div class="bg-white rounded-xl shadow-lg p-8 mb-8">
100
+ <h2 class="text-3xl font-bold text-gray-900 mb-4">🎯 Mission & Vision</h2>
101
+ <p class="text-lg text-gray-700 leading-relaxed mb-4">
102
+ Inspired by Nicolaus Copernicus who challenged accepted knowledge with evidence and rigorous analysis,
103
+ <strong>CopernicusAI</strong> creates collaborative research tools that enable collective participation in
104
+ scientific discovery. These platforms are instruments for exploring humanity's collective knowledgeβ€”tools for
105
+ hypothesis formation, testing, and collaborative research, not just educational content.
106
+ </p>
107
+ <p class="text-gray-600">
108
+ Just as a microscope enables observation of the microscopic world, CopernicusAI tools enable observation and
109
+ exploration of humanity's collective knowledge. Subscribers collaborate to prompt, generate, and refine research
110
+ contentβ€”sharing discoveries publicly or keeping them private. As large language models (LLMs) and AI systems
111
+ gain unprecedented knowledge, CopernicusAI provides the infrastructure for human-AI collaborative knowledge
112
+ exploration, with evidence-based truth-seeking as our guiding principle.
113
+ </p>
 
 
114
  </div>
115
  </section>
116
 
117
+ <!-- Knowledge Engine Ecosystem -->
118
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
119
+ <div class="bg-gradient-to-r from-purple-50 to-indigo-50 rounded-xl shadow-lg p-8 mb-8">
120
+ <h2 class="text-3xl font-bold text-gray-900 mb-6 text-center">🧩 CopernicusAI Knowledge Engine</h2>
121
+ <p class="text-lg text-gray-700 mb-6 text-center max-w-4xl mx-auto">
122
+ An integrated ecosystem of research and collaboration tools designed to assist scientists in their workflow,
123
+ from research discovery through knowledge synthesis to multi-format content generation.
124
+ </p>
125
+
126
+ <!-- Knowledge Engine Implementation Status -->
127
+ <div class="bg-green-50 border-2 border-green-200 rounded-xl p-6 mb-6">
128
+ <div class="flex items-center mb-3">
129
+ <span class="text-3xl mr-3">βœ…</span>
130
+ <h3 class="text-xl font-bold text-gray-900">Knowledge Engine Dashboard - Fully Operational (December 2025)</h3>
131
+ </div>
132
+ <p class="text-gray-700 mb-4">
133
+ The Knowledge Engine is now <strong>fully implemented and deployed</strong> with a working web dashboard providing unified access to all components.
134
  </p>
135
+ <div class="grid md:grid-cols-2 gap-4 mb-4">
136
+ <div>
137
+ <h4 class="font-semibold text-gray-800 mb-2">Key Features:</h4>
138
+ <ul class="text-sm text-gray-700 space-y-1">
139
+ <li>βœ“ Interactive Knowledge Graph (12,000+ papers)</li>
140
+ <li>βœ“ Vector Search (semantic similarity)</li>
141
+ <li>βœ“ RAG System (with citations)</li>
142
+ <li>βœ“ Content Browsing (papers, podcasts, processes)</li>
143
+ <li>βœ“ Statistics Dashboard</li>
144
+ </ul>
145
  </div>
146
+ <div>
147
+ <h4 class="font-semibold text-gray-800 mb-2">Live System:</h4>
148
+ <p class="text-sm text-gray-700 mb-2">
149
+ <a href="https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine"
150
+ target="_blank" rel="noopener noreferrer"
151
+ class="text-blue-600 hover:underline font-medium">
152
+ https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine (opens in new tab)
153
+ </a>
154
+ </p>
155
+ <p class="text-xs text-gray-600">
156
+ Fully deployed to Google Cloud Run, accessible 24/7
157
  </p>
158
  </div>
159
  </div>
160
  </div>
161
+
162
+ <div class="grid md:grid-cols-2 lg:grid-cols-3 gap-6 mb-6">
163
+ <div class="bg-white rounded-lg p-6 card-hover">
164
+ <div class="text-3xl mb-3">πŸ”¬</div>
165
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">CopernicusAI</h3>
166
+ <p class="text-sm text-gray-600 mb-3">Core synthesis & distribution platform for AI-powered research and podcast generation</p>
167
+ <p class="text-xs text-gray-500 italic">This platform</p>
168
+ </div>
169
+
170
+ <div class="bg-white rounded-lg p-6 card-hover border-2 border-green-300">
171
+ <div class="text-3xl mb-3">πŸ—ΊοΈ</div>
172
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">Knowledge Engine Dashboard</h3>
173
+ <p class="text-sm text-gray-600 mb-3">βœ… Fully operational web interface with knowledge graph, vector search, RAG queries, and content browsing</p>
174
+ <a href="https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine" target="_blank" class="text-xs text-blue-600 hover:underline">Live System β†’</a>
175
+ </div>
176
+
177
+ <div class="bg-white rounded-lg p-6 card-hover">
178
+ <div class="text-3xl mb-3">πŸ› οΈ</div>
179
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">Programming Framework</h3>
180
+ <p class="text-sm text-gray-600 mb-3">Foundational meta-tool for universal process analysis across disciplines</p>
181
+ <a href="https://huggingface.co/spaces/garywelz/programming_framework" target="_blank" class="text-xs text-blue-600 hover:underline">Explore β†’</a>
182
+ </div>
183
+
184
+ <div class="bg-white rounded-lg p-6 card-hover">
185
+ <div class="text-3xl mb-3">🧬</div>
186
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">GLMP</h3>
187
+ <p class="text-sm text-gray-600 mb-3">Biological process visualization - 50+ processes mapped</p>
188
+ <a href="https://huggingface.co/spaces/garywelz/glmp" target="_blank" class="text-xs text-blue-600 hover:underline">Explore β†’</a>
189
+ </div>
190
+
191
+ <div class="bg-white rounded-lg p-6 card-hover">
192
+ <div class="text-3xl mb-3">πŸ“š</div>
193
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">Metadata Database</h3>
194
+ <p class="text-sm text-gray-600 mb-3">Core data infrastructure for research paper metadata and citation networks</p>
195
+ <a href="https://huggingface.co/spaces/garywelz/metadata_database" target="_blank" class="text-xs text-blue-600 hover:underline">Explore β†’</a>
196
+ </div>
197
+
198
+ <div class="bg-white rounded-lg p-6 card-hover">
199
+ <div class="text-3xl mb-3">🎬</div>
200
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">Video Database</h3>
201
+ <p class="text-sm text-gray-600 mb-3">Multi-modal content with transcript-based search for scientific videos</p>
202
+ <a href="https://huggingface.co/spaces/garywelz/sciencevideodb" target="_blank" class="text-xs text-blue-600 hover:underline">Explore β†’</a>
203
+ </div>
204
+
205
+ <div class="bg-white rounded-lg p-6 card-hover border-2 border-dashed border-gray-300">
206
+ <div class="text-3xl mb-3 opacity-50">βž•</div>
207
+ <h3 class="text-lg font-semibold text-gray-900 mb-2">Future Components</h3>
208
+ <p class="text-sm text-gray-600 mb-3">Additional tools, databases, and collaboration features will be added as the project develops</p>
209
+ </div>
210
+ </div>
211
  </div>
212
  </section>
213
 
214
+ <!-- Key Statistics -->
215
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
216
+ <div class="grid md:grid-cols-2 gap-6 mb-12">
217
+ <div class="bg-white rounded-lg shadow-md p-6 text-center">
218
+ <div class="stat-number mb-2">250+</div>
219
+ <div class="text-gray-600 font-semibold">Million Papers</div>
220
+ <div class="text-sm text-gray-500 mt-1">Accessible via APIs (As of January 2025)</div>
221
+ </div>
222
+ <div class="bg-white rounded-lg shadow-md p-6 text-center">
223
+ <div class="stat-number mb-2">12,000+</div>
224
+ <div class="text-gray-600 font-semibold">Indexed Papers</div>
225
+ <div class="text-sm text-gray-500 mt-1">In Knowledge Engine - Mathematics (As of December 2025)</div>
226
+ </div>
227
+ <div class="bg-white rounded-lg shadow-md p-6 text-center">
228
+ <div class="stat-number mb-2">64+</div>
229
+ <div class="text-gray-600 font-semibold">Podcast Episodes</div>
230
+ <div class="text-sm text-gray-500 mt-1">Generated across 5 disciplines (As of January 2025)</div>
231
+ </div>
232
+ <div class="bg-white rounded-lg shadow-md p-6 text-center">
233
+ <div class="stat-number mb-2">8+</div>
234
+ <div class="text-gray-600 font-semibold">Academic Databases</div>
235
+ <div class="text-sm text-gray-500 mt-1">Integrated research sources (As of January 2025)</div>
236
+ </div>
237
+ </div>
238
+ </section>
239
+
240
+ <!-- Core Platform Capabilities -->
241
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
242
+ <h2 class="text-3xl font-bold text-gray-900 mb-8 text-center">🌟 Core Platform Capabilities</h2>
243
+
244
+ <div class="space-y-8">
245
+ <!-- AI Podcast Generation -->
246
+ <div class="bg-white rounded-xl shadow-lg p-8">
247
+ <div class="flex items-start mb-4">
248
+ <span class="text-4xl mr-4">πŸŽ™οΈ</span>
249
+ <div class="flex-1">
250
+ <h3 class="text-2xl font-bold text-gray-900 mb-3">AI-Powered Podcast Generation</h3>
251
+ <p class="text-gray-600 mb-4">
252
+ Collaborative research platform where subscribers prompt and generate multi-voice AI podcasts
253
+ (5-10 minutes) synthesizing research from multiple academic sources. Subscribers can share their
254
+ podcasts publicly or keep them private. Evidence-based content generation requiring minimum 3
255
+ research sources per episode.
256
+ </p>
257
+ <div class="grid md:grid-cols-2 gap-4 mt-4">
258
+ <div>
259
+ <h4 class="font-semibold text-gray-800 mb-2">Key Features:</h4>
260
+ <ul class="text-sm text-gray-600 space-y-1">
261
+ <li>βœ“ Comprehensive research integration (8+ databases)</li>
262
+ <li>βœ“ Professional multi-speaker dialogue</li>
263
+ <li>βœ“ AI-generated scientific visualizations</li>
264
+ <li>βœ“ RSS feed distribution</li>
265
+ <li>βœ“ Quality scoring & relevance ranking</li>
266
+ <li>βœ“ Paradigm shift identification</li>
267
+ </ul>
268
+ </div>
269
+ <div>
270
+ <h4 class="font-semibold text-gray-800 mb-2">Research Integration:</h4>
271
+ <ul class="text-sm text-gray-600 space-y-1">
272
+ <li>βœ“ Real-time discovery from 8+ APIs</li>
273
+ <li>βœ“ Parallel search across databases</li>
274
+ <li>βœ“ Automatic citation extraction</li>
275
+ <li>βœ“ Source validation & verification</li>
276
+ <li>βœ“ Interdisciplinary connection analysis</li>
277
+ </ul>
278
+ </div>
279
+ </div>
280
+ </div>
281
  </div>
282
+ </div>
283
 
284
+ <!-- LLM Integration -->
285
+ <div class="bg-white rounded-xl shadow-lg p-8">
286
+ <div class="flex items-start mb-4">
287
+ <span class="text-4xl mr-4">πŸ€–</span>
288
+ <div class="flex-1">
289
+ <h3 class="text-2xl font-bold text-gray-900 mb-3">Advanced LLM Integration</h3>
290
+ <p class="text-gray-600 mb-4">Multi-model architecture with intelligent model selection:</p>
291
+ <div class="grid md:grid-cols-2 gap-4">
292
+ <div>
293
+ <h4 class="font-semibold text-gray-800 mb-2">Primary Models:</h4>
294
+ <ul class="text-sm text-gray-600 space-y-1">
295
+ <li>β€’ <strong>Google Gemini 3</strong> - Latest research analysis and content generation</li>
296
+ <li>β€’ <strong>OpenAI GPT-4/GPT-3.5</strong> - Content synthesis and quality validation</li>
297
+ <li>β€’ <strong>Anthropic Claude 3</strong> (Sonnet, Haiku) - Alternative reasoning paths</li>
298
+ <li>β€’ <strong>ElevenLabs TTS</strong> - Multi-voice text-to-speech synthesis</li>
299
+ </ul>
300
+ </div>
301
+ <div>
302
+ <h4 class="font-semibold text-gray-800 mb-2">Capabilities:</h4>
303
+ <ul class="text-sm text-gray-600 space-y-1">
304
+ <li>β€’ Multi-paper analysis & synthesis</li>
305
+ <li>β€’ Paradigm shift detection</li>
306
+ <li>β€’ Entity extraction (genes, proteins, compounds)</li>
307
+ <li>β€’ Citation tracking & cross-references</li>
308
+ <li>β€’ Content quality scoring</li>
309
+ </ul>
310
+ </div>
311
+ </div>
312
+ </div>
313
  </div>
314
+ </div>
315
 
316
+ <!-- Research Resources -->
317
+ <div class="bg-white rounded-xl shadow-lg p-8">
318
+ <div class="flex items-start mb-4">
319
+ <span class="text-4xl mr-4">πŸ“Š</span>
320
+ <div class="flex-1">
321
+ <h3 class="text-2xl font-bold text-gray-900 mb-3">Research Resource Access</h3>
322
+ <p class="text-gray-600 mb-4">
323
+ Comprehensive academic database coverage with <strong>250+ million research papers</strong> accessible
324
+ through integrated APIs.
325
+ </p>
326
+ <div class="grid md:grid-cols-2 gap-4">
327
+ <div>
328
+ <h4 class="font-semibold text-gray-800 mb-2">Academic Databases:</h4>
329
+ <ul class="text-sm text-gray-600 space-y-1">
330
+ <li>β€’ PubMed/NCBI (~30+ million papers)</li>
331
+ <li>β€’ arXiv (~2+ million preprints)</li>
332
+ <li>β€’ NASA ADS (~15+ million papers)</li>
333
+ <li>β€’ Zenodo (100K+ datasets)</li>
334
+ <li>β€’ bioRxiv/medRxiv (preprints)</li>
335
+ <li>β€’ CORE (~200+ million papers)</li>
336
+ <li>β€’ Google Scholar (comprehensive)</li>
337
+ <li>β€’ News API (current events)</li>
338
+ <li>β€’ YouTube Data API (academic videos)</li>
339
+ </ul>
340
+ </div>
341
+ </div>
342
+ </div>
343
  </div>
344
+ </div>
345
 
346
+ <!-- Audio and Video Podcast Production -->
347
+ <div class="bg-white rounded-xl shadow-lg p-8">
348
+ <div class="flex items-start mb-4">
349
+ <span class="text-4xl mr-4">πŸŽ™οΈ</span>
350
+ <div class="flex-1">
351
+ <h3 class="text-2xl font-bold text-gray-900 mb-3">Audio and Video Podcast Production</h3>
352
+ <p class="text-gray-600 mb-4">
353
+ <strong>Operating Audio Podcast System:</strong> Full production and distribution platform for subscriber-generated
354
+ podcasts. Users can prompt, generate, publish, and distribute audio podcasts with RSS feed support for
355
+ Spotify, Apple Podcasts, and Google Podcasts.
356
+ </p>
357
+ <div class="bg-green-50 rounded-lg p-4 mb-4">
358
+ <h4 class="font-semibold text-gray-800 mb-2">Current Audio Capabilities (Operational):</h4>
359
+ <ul class="text-sm text-gray-700 space-y-1">
360
+ <li>βœ“ Multi-voice AI podcast generation</li>
361
+ <li>βœ“ Research-driven content creation</li>
362
+ <li>βœ“ RSS feed distribution</li>
363
+ <li>βœ“ Public and private podcast options</li>
364
+ <li>βœ“ Professional audio quality</li>
365
+ </ul>
366
+ </div>
367
+ <div class="bg-blue-50 rounded-lg p-4 mt-4">
368
+ <h4 class="font-semibold text-gray-800 mb-2">Video Production (Future - Phase 2+):</h4>
369
+ <p class="text-sm text-gray-700 mb-2">Advanced video features planned for future development:</p>
370
+ <ul class="text-sm text-gray-700 space-y-2">
371
+ <li>β€’ <strong>Visual Content Integration:</strong> Automated extraction from papers, web scraping, JSON database integration</li>
372
+ <li>β€’ <strong>Dynamic Visualizations:</strong> Scientific animations, real-time charts, LaTeX rendering</li>
373
+ <li>β€’ <strong>External Video Quoting:</strong> YouTube segment extraction with attribution & fair use compliance</li>
374
+ <li>β€’ <strong>Advanced Composition:</strong> Multi-layer video, auto subtitles, text overlays, professional transitions</li>
375
+ </ul>
376
+ <p class="text-xs text-gray-600 mt-2">
377
+ See: <a href="https://huggingface.co/spaces/garywelz/sciencevideodb" class="text-blue-600 hover:underline">Science Video Database</a> - Companion project for research video content management.
378
+ </p>
379
+ </div>
380
+ </div>
381
  </div>
382
  </div>
383
 
384
+ <!-- Research Papers Metadata Database -->
385
+ <div class="bg-white rounded-xl shadow-lg p-8">
386
+ <div class="flex items-start mb-4">
387
+ <span class="text-4xl mr-4">πŸ“š</span>
388
+ <div class="flex-1">
389
+ <h3 class="text-2xl font-bold text-gray-900 mb-3">Research Papers Metadata Database (Phase 2)</h3>
390
+ <p class="text-gray-600 mb-4">
391
+ A centralized <strong>metadata repository</strong> (not a file archive) providing structured JSON objects
392
+ with AI-powered preprocessing.
393
+ </p>
394
+ <div class="grid md:grid-cols-2 gap-4">
395
+ <div>
396
+ <h4 class="font-semibold text-gray-800 mb-2">Structured JSON Objects:</h4>
397
+ <ul class="text-sm text-gray-600 space-y-1">
398
+ <li>β€’ DOI, arXiv ID, publication info</li>
399
+ <li>β€’ Abstracts & key findings</li>
400
+ <li>β€’ Extracted entities (genes, proteins, compounds, equations)</li>
401
+ <li>β€’ Citation networks & cross-references</li>
402
+ <li>β€’ Paradigm shift indicators</li>
403
+ <li>β€’ Quality scores & relevance metrics</li>
404
+ </ul>
405
+ </div>
406
+ <div>
407
+ <h4 class="font-semibold text-gray-800 mb-2">AI-Powered Preprocessing:</h4>
408
+ <ul class="text-sm text-gray-600 space-y-1">
409
+ <li>β€’ LLM-based entity extraction</li>
410
+ <li>β€’ Automatic categorization</li>
411
+ <li>β€’ Keyword extraction & semantic tagging</li>
412
+ <li>β€’ Citation tracking & mapping</li>
413
+ <li>β€’ Quality assessment</li>
414
+ <li>β€’ RESTful API access</li>
415
+ </ul>
416
+ </div>
417
+ </div>
 
 
 
 
 
418
  </div>
419
  </div>
420
  </div>
421
  </div>
422
  </section>
423
 
424
+ <!-- Methodological Details -->
425
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
426
+ <div class="bg-white rounded-xl shadow-lg p-8 mb-8">
427
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">πŸ”¬ Methodology & System Design</h2>
428
+
429
+ <div class="grid md:grid-cols-2 gap-6 mb-6">
430
+ <div class="bg-blue-50 rounded-lg p-6">
431
+ <h3 class="text-xl font-semibold text-gray-900 mb-4">Multi-Source Validation Process</h3>
432
+ <p class="text-gray-700 mb-3">
433
+ The system requires a <strong>minimum of 3 research sources</strong> per podcast episode. Each source is:
434
+ </p>
435
+ <ul class="text-sm text-gray-700 space-y-2">
436
+ <li>β€’ Retrieved from authoritative academic databases (PubMed, arXiv, NASA ADS, etc.)</li>
437
+ <li>β€’ Validated for authenticity and publication status</li>
438
+ <li>β€’ Scored for quality and relevance to the research topic</li>
439
+ <li>β€’ Cross-referenced to verify consistency and eliminate conflicting information</li>
440
+ <li>β€’ Processed through parallel API queries for comprehensive coverage</li>
441
+ </ul>
442
+ </div>
443
+
444
+ <div class="bg-green-50 rounded-lg p-6">
445
+ <h3 class="text-xl font-semibold text-gray-900 mb-4">Quality Assurance Mechanisms</h3>
446
+ <ul class="text-sm text-gray-700 space-y-2">
447
+ <li>β€’ <strong>Source Verification:</strong> Automated checking of DOI, arXiv IDs, and publication metadata</li>
448
+ <li>β€’ <strong>Relevance Scoring:</strong> LLM-based assessment of paper relevance to query</li>
449
+ <li>β€’ <strong>Paradigm Shift Detection:</strong> Identification of revolutionary vs. incremental research</li>
450
+ <li>β€’ <strong>Citation Extraction:</strong> Automatic extraction and formatting of citations</li>
451
+ <li>β€’ <strong>Content Validation:</strong> Multi-model verification (Gemini, GPT-4, Claude) for accuracy</li>
452
+ </ul>
453
+ </div>
454
  </div>
455
+
456
+ <div class="bg-purple-50 rounded-lg p-6 mb-6">
457
+ <h3 class="text-xl font-semibold text-gray-900 mb-4">Citation Extraction & Verification</h3>
458
+ <p class="text-gray-700 mb-3">
459
+ The system automatically extracts and formats citations from research papers:
 
 
460
  </p>
461
+ <ul class="text-sm text-gray-700 space-y-2">
462
+ <li>β€’ DOI resolution and metadata enrichment</li>
463
+ <li>β€’ arXiv ID parsing and preprint identification</li>
464
+ <li>β€’ Author, title, and publication information extraction</li>
465
+ <li>β€’ Cross-reference linking between related papers</li>
466
+ <li>β€’ Citation network analysis for relationship mapping</li>
467
+ </ul>
468
  </div>
469
+
470
+ <div class="bg-orange-50 rounded-lg p-6">
471
+ <h3 class="text-xl font-semibold text-gray-900 mb-4">Paradigm Shift Detection Implementation</h3>
472
+ <p class="text-gray-700 mb-3">
473
+ The system uses LLM analysis to identify paradigm-shifting research by:
 
 
474
  </p>
475
+ <ul class="text-sm text-gray-700 space-y-2">
476
+ <li>β€’ Analyzing citation patterns and impact metrics</li>
477
+ <li>β€’ Detecting novel methodologies or breakthrough discoveries</li>
478
+ <li>β€’ Comparing against established knowledge frameworks</li>
479
+ <li>β€’ Identifying interdisciplinary connections and cross-domain insights</li>
480
+ <li>β€’ Flagging research that challenges existing paradigms</li>
481
+ </ul>
482
  </div>
483
  </div>
484
  </section>
485
 
486
+ <!-- Technology Stack -->
 
487
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
488
+ <div class="bg-white rounded-xl shadow-lg p-8">
489
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">βš™οΈ Technology Stack</h2>
 
 
 
490
 
491
+ <div class="grid md:grid-cols-3 gap-6 mb-6">
492
+ <div>
493
+ <h3 class="text-lg font-semibold text-gray-800 mb-3">AI & Machine Learning</h3>
494
+ <ul class="text-sm text-gray-600 space-y-1">
495
+ <li>β€’ Google Gemini 3</li>
496
+ <li>β€’ Google Vertex AI (model orchestration)</li>
497
+ <li>β€’ OpenAI GPT-4/GPT-3.5</li>
498
+ <li>β€’ Anthropic Claude 3</li>
499
+ <li>β€’ ElevenLabs TTS</li>
500
+ <li>β€’ DALL-E 3</li>
501
+ <li>β€’ Cloud Vision API</li>
502
+ <li>β€’ Video Intelligence API</li>
503
+ </ul>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
504
  </div>
505
+
506
+ <div>
507
+ <h3 class="text-lg font-semibold text-gray-800 mb-3">Backend Infrastructure</h3>
508
+ <ul class="text-sm text-gray-600 space-y-1">
509
+ <li>β€’ FastAPI (Python)</li>
510
+ <li>β€’ Google Cloud Run</li>
511
+ <li>β€’ Firestore (NoSQL)</li>
512
+ <li>β€’ Cloud Storage</li>
513
+ <li>β€’ Cloud Functions</li>
514
+ <li>β€’ Cloud Tasks</li>
515
+ <li>β€’ Secret Manager</li>
516
+ </ul>
 
 
 
 
 
517
  </div>
518
+
519
+ <div>
520
+ <h3 class="text-lg font-semibold text-gray-800 mb-3">Frontend</h3>
521
+ <ul class="text-sm text-gray-600 space-y-1">
522
+ <li>β€’ Next.js 15.5.7</li>
523
+ <li>β€’ Alpine.js</li>
524
+ <li>β€’ Tailwind CSS</li>
525
+ <li>β€’ Vercel</li>
526
+ </ul>
 
 
 
 
 
 
 
 
527
  </div>
528
  </div>
529
  </div>
530
  </section>
531
 
532
+ <!-- Limitations & Future Work -->
533
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
534
+ <div class="bg-white rounded-xl shadow-lg p-8 mb-8">
535
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">πŸ” Limitations & Future Directions</h2>
536
 
537
+ <div class="grid md:grid-cols-2 gap-6">
538
+ <div class="bg-yellow-50 rounded-lg p-6 border-l-4 border-yellow-400">
539
+ <h3 class="text-xl font-semibold text-gray-900 mb-4">Current Limitations</h3>
540
+ <ul class="text-sm text-gray-700 space-y-2">
541
+ <li>β€’ <strong>Discipline Coverage:</strong> Mathematics currently has the most complete indexing (12,000+ papers); other disciplines are being expanded</li>
542
+ <li>β€’ <strong>Source Bias:</strong> Coverage depends on database API availability and open access policies</li>
543
+ <li>β€’ <strong>LLM Accuracy:</strong> Content generation relies on LLM accuracy; multi-source validation mitigates but doesn't eliminate errors</li>
544
+ <li>β€’ <strong>Real-Time Updates:</strong> Knowledge graph updates require manual or scheduled processing cycles</li>
545
+ <li>β€’ <strong>Language:</strong> Currently optimized for English-language research papers</li>
546
  </ul>
547
  </div>
548
+
549
+ <div class="bg-blue-50 rounded-lg p-6 border-l-4 border-blue-400">
550
+ <h3 class="text-xl font-semibold text-gray-900 mb-4">Future Development</h3>
551
+ <ul class="text-sm text-gray-700 space-y-2">
552
+ <li>β€’ <strong>Multi-Discipline Expansion:</strong> Expanding knowledge graph to Biology, Chemistry, Physics, Computer Science</li>
553
+ <li>β€’ <strong>Process Databases:</strong> Creating comprehensive flowchart databases for all 5 disciplines (~50 processes each)</li>
554
+ <li>β€’ <strong>Advanced Video Features:</strong> Dynamic visualizations, animations, and multi-layer composition</li>
555
+ <li>β€’ <strong>Multi-Language Support:</strong> Extending to non-English research papers</li>
556
+ <li>β€’ <strong>Enhanced Validation:</strong> Peer review mechanisms and user feedback integration</li>
557
+ <li>β€’ <strong>Real-Time Updates:</strong> Automated continuous knowledge graph updates</li>
558
  </ul>
559
  </div>
560
+ </div>
561
+ </div>
562
+ </section>
563
 
564
+ <!-- Research & Collaborative Tools -->
565
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
566
+ <div class="bg-gradient-to-r from-green-50 to-blue-50 rounded-xl p-8">
567
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">πŸ”¬ Collaborative Research Tools</h2>
568
+
569
+ <div class="grid md:grid-cols-2 gap-6 mb-6">
570
  <div>
571
+ <h3 class="text-xl font-semibold text-gray-800 mb-3">Collaborative Research Tools</h3>
572
+ <p class="text-gray-700 mb-3">
573
+ These platforms enable collective participation and collaboration across diverse user communities:
574
+ </p>
575
+ <ul class="text-gray-700 space-y-2">
576
+ <li>β€’ <strong>Researchers</strong> - Tools for hypothesis formation and testing, cross-disciplinary synthesis</li>
577
+ <li>β€’ <strong>Collaborators</strong> - Collective knowledge exploration and refinement</li>
578
+ <li>β€’ <strong>Subscribers</strong> - Prompt, generate, and share podcasts (public or private)</li>
579
+ <li>β€’ <strong>Community</strong> - User suggestions, comments, and collaborative flowchart improvement (GLMP)</li>
580
  </ul>
581
+ <p class="text-gray-600 mt-4 italic">
582
+ Like a microscope enables observation of the microscopic world, these tools enable observation and
583
+ exploration of humanity's collective knowledge.
584
+ </p>
585
  </div>
586
+
587
  <div>
588
+ <h3 class="text-xl font-semibold text-gray-800 mb-3">Key Innovations</h3>
589
+ <ul class="text-gray-700 space-y-2">
590
+ <li>β€’ Multi-source validation (min 3 sources)</li>
591
+ <li>β€’ Evidence-based generation</li>
592
+ <li>β€’ Paradigm shift detection</li>
593
+ <li>β€’ Interdisciplinary connections</li>
594
+ <li>β€’ Multiple expertise levels</li>
595
+ <li>β€’ Full citation tracking</li>
596
  </ul>
597
  </div>
598
  </div>
599
  </div>
600
  </section>
601
 
602
+ <!-- Prior Work & Research Contributions -->
603
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
604
+ <div class="bg-gradient-to-r from-purple-50 to-blue-50 rounded-xl shadow-lg p-8">
605
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">πŸ“š Prior Work & Research Contributions</h2>
606
+
607
+ <div class="bg-white rounded-lg p-6 mb-6">
608
+ <h3 class="text-xl font-semibold text-gray-900 mb-4">Overview</h3>
609
+ <p class="text-gray-700 mb-4">
610
+ This platform represents <strong>prior work</strong> that demonstrates foundational research and development
611
+ achievements in AI-powered scientific knowledge synthesis, collaborative research tools, and multi-modal content
612
+ generation. These contributions establish the technical foundation and proof-of-concept for the broader
613
+ <strong>CopernicusAI Knowledge Engine</strong> initiative.
614
  </p>
 
 
 
 
 
615
  </div>
616
 
617
+ <div class="grid md:grid-cols-2 gap-6 mb-6">
618
+ <div class="bg-white rounded-lg p-6">
619
+ <h3 class="text-lg font-semibold text-gray-900 mb-3">πŸ”¬ Research Contributions</h3>
620
+ <ul class="text-sm text-gray-700 space-y-2">
621
+ <li>β€’ <strong>AI-Powered Research Synthesis:</strong> Production system for multi-source research synthesis using LLMs</li>
622
+ <li>β€’ <strong>Multi-Model Architecture:</strong> Intelligent model selection with Gemini 3, GPT-4, Claude 3</li>
623
+ <li>β€’ <strong>Collaborative Platform:</strong> Subscriber-driven content generation with public/private sharing</li>
624
+ <li>β€’ <strong>Knowledge Engine Integration:</strong> Architecture for Research Papers DB, Video DB, GLMP, Framework</li>
625
+ </ul>
626
+ </div>
627
+
628
+ <div class="bg-white rounded-lg p-6">
629
+ <h3 class="text-lg font-semibold text-gray-900 mb-3">βš™οΈ Technical Achievements</h3>
630
+ <ul class="text-sm text-gray-700 space-y-2">
631
+ <li>β€’ <strong>250+ Million Papers:</strong> Accessible via 8+ integrated academic databases</li>
632
+ <li>β€’ <strong>64+ Episodes:</strong> Generated across 5 scientific disciplines</li>
633
+ <li>β€’ <strong>Production Deployment:</strong> Live platform with operational API and RSS distribution</li>
634
+ <li>β€’ <strong>Scalable Architecture:</strong> Serverless microservices on Google Cloud</li>
635
+ </ul>
636
+ </div>
637
+ </div>
638
+
639
+ <div class="bg-white rounded-lg p-6 mb-6">
640
+ <h3 class="text-lg font-semibold text-gray-900 mb-3">🎯 Position Within CopernicusAI Knowledge Engine</h3>
641
+ <p class="text-gray-700 mb-3">
642
+ This platform serves as the <strong>core synthesis and distribution component</strong> of the CopernicusAI Knowledge Engine.
643
+ The Knowledge Engine is an integrated ecosystem of research and collaboration tools that work together to assist scientists
644
+ in their workflow, from research discovery through knowledge synthesis to multi-format content generation.
645
  </p>
646
+ <div class="bg-blue-50 rounded-lg p-4 mb-3">
647
+ <h4 class="font-semibold text-gray-900 mb-2">Current Components:</h4>
648
+ <div class="grid md:grid-cols-2 gap-4 text-sm">
649
+ <ul class="text-gray-700 space-y-1">
650
+ <li>1. <strong>CopernicusAI</strong> (This platform) - Core synthesis & distribution</li>
651
+ <li>2. <strong>Programming Framework</strong> - Foundational meta-tool</li>
652
+ <li>3. <strong>GLMP</strong> - Biological process visualization</li>
653
+ </ul>
654
+ <ul class="text-gray-700 space-y-1">
655
+ <li>4. <strong>Research Paper Metadata Database</strong> - Data infrastructure</li>
656
+ <li>5. <strong>Science Video Database</strong> - Multi-modal content</li>
657
+ </ul>
658
+ </div>
659
+ </div>
660
+ <div class="bg-purple-50 rounded-lg p-4">
661
+ <h4 class="font-semibold text-gray-900 mb-2">Future Development:</h4>
662
+ <p class="text-gray-700 text-sm">
663
+ The Knowledge Engine is designed to grow and evolve. Additional tools, databases, and collaboration components
664
+ will be added as the project develops, expanding capabilities for AI-assisted scientific research and knowledge discovery.
665
+ </p>
666
+ </div>
667
+ </div>
668
+
669
+ <div class="bg-blue-50 rounded-lg p-6">
670
+ <h3 class="text-lg font-semibold text-gray-900 mb-3">πŸ“– Citation Information</h3>
671
+ <p class="text-sm text-gray-700 mb-3">
672
+ <strong>For Grant Proposals (NSF/DOE):</strong>
673
+ </p>
674
+ <div class="bg-white rounded p-4 font-mono text-sm text-gray-800 mb-4">
675
+ <p class="mb-2">Welz, G. (2025). CopernicusAI: Knowledge Engine for Scientific Discovery.</p>
676
+ <p class="mb-2">Hugging Face Space. https://huggingface.co/spaces/garywelz/copernicusai</p>
677
+ <p>Live Platform: https://www.copernicusai.fyi</p>
678
+ </div>
679
+ <div class="bg-white rounded p-4 mb-4">
680
+ <p class="text-sm font-semibold text-gray-700 mb-2">BibTeX Format:</p>
681
+ <pre class="bg-gray-900 text-green-400 p-3 rounded text-xs overflow-x-auto"><code>@misc{welz2025copernicusai,
682
+ title={CopernicusAI: Knowledge Engine for Scientific Discovery},
683
+ author={Welz, Gary},
684
+ year={2025},
685
+ url={https://huggingface.co/spaces/garywelz/copernicusai},
686
+ note={Hugging Face Space, Live Platform: https://www.copernicusai.fyi}
687
+ }</code></pre>
688
+ </div>
689
+ </div>
690
+ </div>
691
+ </section>
692
+
693
+ <!-- Data Availability Statement -->
694
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
695
+ <div class="bg-white rounded-xl shadow-lg p-8 mb-8">
696
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">πŸ“Š Data Availability Statement</h2>
697
+
698
+ <div class="bg-blue-50 rounded-lg p-6 mb-4">
699
+ <h3 class="text-xl font-semibold text-gray-900 mb-4">Platform Access</h3>
700
+ <ul class="text-gray-700 space-y-2">
701
+ <li>β€’ <strong>Live Platform:</strong> <a href="https://www.copernicusai.fyi" target="_blank" class="text-blue-600 hover:underline">https://www.copernicusai.fyi</a> (opens in new tab)</li>
702
+ <li>β€’ <strong>Knowledge Engine Dashboard:</strong> <a href="https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine" target="_blank" class="text-blue-600 hover:underline">https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine</a> (opens in new tab)</li>
703
+ <li>β€’ <strong>API Base URL:</strong> <code class="bg-gray-100 px-2 py-1 rounded">https://copernicus-podcast-api-phzp4ie2sq-uc.a.run.app</code></li>
704
+ <li>β€’ <strong>RSS Feed:</strong> <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/feeds/copernicus-mvp-rss-feed.xml" target="_blank" class="text-blue-600 hover:underline">Available for public access</a> (opens in new tab)</li>
705
+ </ul>
706
+ </div>
707
+
708
+ <div class="bg-green-50 rounded-lg p-6 mb-4">
709
+ <h3 class="text-xl font-semibold text-gray-900 mb-4">Data & Code Availability</h3>
710
+ <ul class="text-gray-700 space-y-2">
711
+ <li>β€’ <strong>Hugging Face Spaces:</strong> All components accessible at <a href="https://huggingface.co/garywelz" target="_blank" class="text-blue-600 hover:underline">https://huggingface.co/garywelz</a> (opens in new tab)</li>
712
+ <li>β€’ <strong>Process Flowcharts (GLMP):</strong> JSON files stored in Google Cloud Storage, accessible via <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/glmp-database-table.html" target="_blank" class="text-blue-600 hover:underline">GLMP Database Table</a> (opens in new tab)</li>
713
+ <li>β€’ <strong>Research Paper Metadata:</strong> 12,000+ indexed papers with metadata accessible through Knowledge Engine Dashboard</li>
714
+ <li>β€’ <strong>API Documentation:</strong> RESTful API endpoints available for programmatic access (see API Documentation section)</li>
715
+ </ul>
716
+ </div>
717
+
718
+ <div class="bg-purple-50 rounded-lg p-6">
719
+ <h3 class="text-xl font-semibold text-gray-900 mb-4">Reproducibility Information</h3>
720
+ <ul class="text-gray-700 space-y-2">
721
+ <li>β€’ <strong>Technology Stack:</strong> All technologies and versions documented in Technology Stack section</li>
722
+ <li>β€’ <strong>LLM Models:</strong> Google Gemini 3, OpenAI GPT-4/GPT-3.5, Anthropic Claude 3 (versions specified in documentation)</li>
723
+ <li>β€’ <strong>Source Citations:</strong> All podcast episodes include full citations to source papers</li>
724
+ <li>β€’ <strong>Metadata:</strong> Complete metadata for all generated content available through API</li>
725
+ <li>β€’ <strong>License:</strong> MIT License - see license information in space metadata</li>
726
+ </ul>
727
  </div>
728
  </div>
729
  </section>
 
732
  <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
733
  <div class="bg-white rounded-xl shadow-lg p-8">
734
  <h2 class="text-3xl font-bold text-gray-900 mb-6">How to Cite This Work</h2>
735
+ <div class="bg-gray-50 rounded-lg p-6 mb-4">
736
+ <p class="text-gray-800 font-mono text-lg leading-relaxed mb-4">
737
+ Welz, G. (2024–2025). <em>CopernicusAI: AI-Generated Audio Briefings as a Research Interface</em>.<br>
738
+ Hugging Face Spaces. https://huggingface.co/spaces/garywelz/copernicusai
739
+ </p>
740
+
741
+ <div class="border-t border-gray-300 pt-4 mt-4">
742
+ <p class="text-sm font-semibold text-gray-700 mb-2">BibTeX Format:</p>
743
+ <pre class="bg-gray-800 text-green-400 p-4 rounded text-sm overflow-x-auto"><code>@misc{welz2025copernicusai,
744
+ title={CopernicusAI: AI-Generated Audio Briefings as a Research Interface},
745
  author={Welz, Gary},
746
  year={2024--2025},
747
+ url={https://huggingface.co/spaces/garywelz/copernicusai},
748
+ note={Hugging Face Space}
749
  }</code></pre>
750
+ </div>
751
+ </div>
752
+ </div>
753
+ </section>
754
+
755
+ <!-- Grant Support & Collaboration -->
756
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
757
+ <div class="bg-white rounded-xl shadow-lg p-8">
758
+ <h2 class="text-3xl font-bold text-gray-900 mb-6">🌐 Grant Support & Collaboration</h2>
759
+
760
+ <div class="mb-6">
761
+ <h3 class="text-xl font-semibold text-gray-800 mb-3">Grant Applications Supported</h3>
762
+ <p class="text-gray-700 mb-4">
763
+ This platform is designed to support grant applications to:
764
+ </p>
765
+ <div class="grid md:grid-cols-3 gap-4">
766
+ <div class="bg-blue-50 rounded-lg p-4">
767
+ <h4 class="font-semibold text-gray-800 mb-2">NSF</h4>
768
+ <p class="text-sm text-gray-600">National Science Foundation - Science education and research infrastructure</p>
769
+ </div>
770
+ <div class="bg-green-50 rounded-lg p-4">
771
+ <h4 class="font-semibold text-gray-800 mb-2">DOE</h4>
772
+ <p class="text-sm text-gray-600">Department of Energy - Scientific computing and data science</p>
773
+ </div>
774
+ <div class="bg-purple-50 rounded-lg p-4">
775
+ <h4 class="font-semibold text-gray-800 mb-2">SAIR Foundation</h4>
776
+ <p class="text-sm text-gray-600">AI research and development initiatives</p>
777
  </div>
778
  </div>
779
+ </div>
780
+
781
+ <div>
782
+ <h3 class="text-xl font-semibold text-gray-800 mb-3">Collaboration Opportunities</h3>
783
+ <ul class="text-gray-700 space-y-2">
784
+ <li>β€’ Integration with academic institutions</li>
785
+ <li>β€’ Partnership with research organizations</li>
786
+ <li>β€’ Open data initiatives</li>
787
+ <li>β€’ Educational program development</li>
788
+ </ul>
789
+ </div>
790
+ </div>
791
+ </section>
792
+
793
+ <!-- Links & Resources -->
794
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-12">
795
+ <div class="bg-gradient-to-r from-blue-50 to-purple-50 rounded-xl p-8">
796
+ <h2 class="text-3xl font-bold text-gray-900 mb-6 text-center">πŸ”— Live Platform & Resources</h2>
797
+
798
+ <div class="grid md:grid-cols-2 gap-6">
799
+ <div class="bg-white rounded-lg p-6">
800
+ <h3 class="text-xl font-semibold text-gray-800 mb-4">🌐 Production Deployment</h3>
801
+ <ul class="space-y-2">
802
+ <li>
803
+ <a href="https://www.copernicusai.fyi" target="_blank" rel="noopener noreferrer"
804
+ class="text-blue-600 hover:text-blue-800 font-medium">
805
+ 🏠 Homepage - Browse Podcasts (opens in new tab)
806
+ </a>
807
+ </li>
808
+ <li>
809
+ <a href="https://www.copernicusai.fyi/subscriber-dashboard.html" target="_blank" rel="noopener noreferrer"
810
+ class="text-blue-600 hover:text-blue-800 font-medium">
811
+ πŸ“Š Creator Dashboard (opens in new tab)
812
+ </a>
813
+ </li>
814
+ <li>
815
+ <a href="https://storage.googleapis.com/regal-scholar-453620-r7-podcast-storage/feeds/copernicus-mvp-rss-feed.xml" target="_blank" rel="noopener noreferrer"
816
+ class="text-blue-600 hover:text-blue-800 font-medium">
817
+ πŸ“‘ RSS Feed (opens in new tab)
818
+ </a>
819
+ </li>
820
+ </ul>
821
+ </div>
822
+
823
+ <div class="bg-white rounded-lg p-6">
824
+ <h3 class="text-xl font-semibold text-gray-800 mb-4">🧩 Knowledge Engine Components</h3>
825
+ <p class="text-sm text-gray-600 mb-4">
826
+ The CopernicusAI Knowledge Engine is an integrated ecosystem of research and collaboration tools.
827
+ The <strong>Knowledge Engine Dashboard is now fully operational</strong> (December 2025) with a working web interface providing unified access to all components.
828
  </p>
829
+ <div class="bg-green-50 rounded-lg p-4 mb-4">
830
+ <h4 class="font-semibold text-gray-800 mb-2">βœ… Knowledge Engine Dashboard (Implemented)</h4>
831
+ <p class="text-sm text-gray-700 mb-2">
832
+ Fully operational web interface with knowledge graph visualization (12,000+ papers), vector search, RAG queries, and content browsing.
833
+ </p>
834
+ <a href="https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine"
835
+ target="_blank" rel="noopener noreferrer"
836
+ class="text-blue-600 hover:underline text-sm font-medium">
837
+ Live System: https://copernicus-frontend-phzp4ie2sq-uc.a.run.app/knowledge-engine β†’ (opens in new tab)
838
+ </a>
839
+ </div>
840
+ <ul class="space-y-3">
841
+ <li>
842
+ <a href="https://huggingface.co/spaces/garywelz/programming_framework" target="_blank" rel="noopener noreferrer"
843
+ class="text-blue-600 hover:text-blue-800 font-medium">
844
+ πŸ› οΈ Programming Framework (opens in new tab)
845
+ </a>
846
+ <p class="text-sm text-gray-600 mt-1 ml-6">
847
+ Foundational meta-tool for universal process analysis across any discipline
848
+ </p>
849
+ </li>
850
+ <li>
851
+ <a href="https://huggingface.co/spaces/garywelz/glmp" target="_blank" rel="noopener noreferrer"
852
+ class="text-blue-600 hover:text-blue-800 font-medium">
853
+ 🧬 GLMP - Genome Logic Modeling Project (opens in new tab)
854
+ </a>
855
+ <p class="text-sm text-gray-600 mt-1 ml-6">
856
+ First application of Programming Framework to biology - 50+ biological processes visualized
857
+ </p>
858
+ </li>
859
+ <li>
860
+ <a href="https://huggingface.co/spaces/garywelz/metadata_database" target="_blank" rel="noopener noreferrer"
861
+ class="text-blue-600 hover:text-blue-800 font-medium">
862
+ πŸ“š Research Paper Metadata Database (opens in new tab)
863
+ </a>
864
+ <p class="text-sm text-gray-600 mt-1 ml-6">
865
+ Core data infrastructure for structured research paper metadata and citation networks
866
+ </p>
867
+ </li>
868
+ <li>
869
+ <a href="https://huggingface.co/spaces/garywelz/sciencevideodb" target="_blank" rel="noopener noreferrer"
870
+ class="text-blue-600 hover:text-blue-800 font-medium">
871
+ 🎬 Science Video Database (opens in new tab)
872
+ </a>
873
+ <p class="text-sm text-gray-600 mt-1 ml-6">
874
+ Multi-modal content component with transcript-based search for scientific videos
875
+ </p>
876
+ </li>
877
+ </ul>
878
  </div>
879
  </div>
880
+ </div>
881
+ </section>
882
+
883
+ <!-- API Endpoints -->
884
+ <section class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 py-8">
885
+ <div class="bg-gray-900 text-white rounded-xl p-8">
886
+ <h2 class="text-3xl font-bold mb-6">πŸ”Œ API Documentation</h2>
887
+ <p class="text-gray-300 mb-6">Base URL: <code class="bg-gray-800 px-2 py-1 rounded">https://copernicus-podcast-api-phzp4ie2sq-uc.a.run.app</code></p>
888
+
889
+ <div class="grid md:grid-cols-3 gap-4 text-sm">
890
+ <div>
891
+ <h4 class="font-semibold text-blue-300 mb-2">Podcast Generation</h4>
892
+ <ul class="space-y-1 text-gray-400">
893
+ <li>POST /generate-podcast-with-subscriber</li>
894
+ <li>GET /api/subscribers/podcasts/{id}</li>
895
+ <li>POST /api/subscribers/podcasts/submit-to-rss</li>
896
+ </ul>
897
+ </div>
898
+
899
+ <div>
900
+ <h4 class="font-semibold text-blue-300 mb-2">Research Endpoints</h4>
901
+ <ul class="space-y-1 text-gray-400">
902
+ <li>POST /api/papers/upload</li>
903
+ <li>GET /api/papers/{paper_id}</li>
904
+ <li>POST /api/papers/query</li>
905
+ <li>POST /api/papers/{id}/link-podcast/{id}</li>
906
+ </ul>
907
+ </div>
908
+
909
+ <div>
910
+ <h4 class="font-semibold text-blue-300 mb-2">Admin Endpoints</h4>
911
+ <ul class="space-y-1 text-gray-400">
912
+ <li>GET /api/admin/subscribers</li>
913
+ <li>POST /api/admin/podcasts/fix-missing-titles</li>
914
+ <li>GET /api/admin/podcasts/catalog</li>
915
+ </ul>
916
+ </div>
917
  </div>
918
  </div>
919
  </section>
 
921
  <!-- Footer -->
922
  <footer class="gradient-bg text-white py-8 mt-12">
923
  <div class="max-w-7xl mx-auto px-4 sm:px-6 lg:px-8 text-center">
924
+ <p class="text-lg font-semibold mb-2">CopernicusAI - Advancing Scientific Knowledge</p>
925
+ <p class="text-sm opacity-75">Built with Google Cloud, Gemini AI, OpenAI, Anthropic Claude, and ElevenLabs</p>
926
+ <p class="text-xs opacity-50 mt-4">&copy; 2025 CopernicusAI. All rights reserved.</p>
927
  </div>
928
  </footer>
 
 
 
 
929
  </body>
930
  </html>