✅ Evidence-based validation

Novelty Validation
The System Doesn't Hallucinate

Generating ideas is easy. Determining if they are new and valuable is the real challenge. The system includes an automated audit module that acts as a preliminary "Peer Reviewer".

📧 Request Demo 🔬 Back to Atiendia Research

⚠️

The Problem of "Novel Ideas"

Generative AI systems can produce hypotheses that sound plausible but have actually already been explored in the literature. Without external validation, you risk:

× Reinventing the wheel: Wasting resources on research already done
× False novelties: Presenting something as original that already exists
× Wasted time: Researchers following leads that already have answers

The Solution: Truth-Checking with Global Databases

Every idea is validated against global scientific literature in real-time

📚

Semantic Scholar

AI2 (Allen Institute)

✓ 200M+ papers indexed

✓ Complete Citation Graph (who cites whom)

✓ Semantic API: search by concepts, not just keywords

✓ Automatic TL;DR generated by AI

✓ Multidisciplinary coverage (CS, medicine, physics, etc.)

🌍

OpenAlex

OurResearch (ex Microsoft Academic)

✓ 250M+ works cataloged

✓ 100% Open Access (free and complete API)

✓ Global coverage: papers in all languages

✓ Rich metadata: authors, institutions, concepts

✓ Continuous updates (new papers daily)

🔍 More than 400 million papers consulted automatically

Evidence-Based Validation Process

Three steps for each generated idea

1

🔍 Evidence Retrieval

The system builds semantic queries from the proposed idea and actively searches Semantic Scholar and OpenAlex for papers that have already explored that specific combination of concepts.

Example query:

"transfer learning computer vision natural language processing cross-domain application"

2

🤖 Automated Judgment

A specialized evaluator model (LLM with review prompt) analyzes:

→ The proposed idea vs the abstracts/conclusions of found papers
→ If the similarity is superficial (keywords) or substantial (same hypothesis)
→ Nuances that could differentiate the new idea from existing ones

3

📊 State Classification

Based on the recovered evidence, the idea is classified into one of three states:

Novelty Validation States

Automatic classification of novelty level based on scientific evidence

🟢

NOVEL_BRIDGE

Genuinely New

No significant evidence was recovered connecting the key concepts of the proposed idea. It is a genuine research gap with high probability of being unpublished.

Metrics:

• novelty_score > 0.8
• Exhaustive search without relevant results
• "Blue Ocean" opportunity

🚀 Recommended action:

✓ High priority for research
✓ Proceed with formal hypothesis development
✓ Design preliminary experiments
✓ Secure resources and funding

🟡

EMERGING_LINK

Recent Trend

Scattered or very recent evidence found. The topic is beginning to explore, there are recent papers (last 1-2 years) but no consensus or standard solution yet.

Metrics:

• novelty_score 0.4 - 0.7
• Found < 5 related papers
• Growing publication dates

⚠️ Recommended action:

✓ Review the found papers in depth
✓ Differentiate the proposal clearly from them
✓ Good for incremental publication (State of the Art + Delta)

🔴

KNOWN_LINK

Consolidated

The connection is well established in prior literature. The system provides existing references (prior_art_refs) for consultation.

Metrics:

• knownness_score > 0.75
• Pre-existing consolidated literature
• Marked as redundant

🔄 Recommended action:

× Discard or pivot the original idea
✓ Read existing papers (learning)
✓ Look for a completely unexplored angle
✓ Consider novel extensions or variations

⚪

UNCERTAIN

Ambiguous

Insufficient or contradictory evidence. The LLM evaluator could not confidently determine if the idea is new or existing.

Possible causes:

• Concepts in emerging interdisciplinary areas
• Ambiguous or variable terminology
• Limited coverage in databases

🔍 Recommended action:

⚠ Requires expert human eye
⚠ Review recovered evidence manually
⚠ Consult with domain expert
⚠ Refine the idea formulation and re-validate

External Judge Decisions

Specialized evaluator model (e.g., GPT-4o mini) that analyzes abstracts

✨

NOVEL

Judge Verdict

The specific combination of concepts does not appear in the recovered evidence. The analyzed abstracts show no direct interaction between the proposed ideas.

📚

KNOWN

Judge Verdict

The concepts appear directly interacting in the evidence. Multiple papers demonstrate that the connection has already been explored or implemented.

🌱

EMERGING

Judge Verdict

The interaction appears only in recent literature. The topic is young and is in active exploration phase by the scientific community.

Scoring Metrics

N

novelty_score

Range: 0.0 - 1.0

Estimated probability that the idea is unpublished. Calculated from:

• Absence of papers with high semantic similarity
• External judge verdict (high weight)
• Quantity and quality of recovered evidence

Typical threshold: novelty_score > 0.8 → NOVEL_BRIDGE classification

K

knownness_score

Range: 0.0 - 1.0

Degree of certainty that the idea already exists in the literature. Conceptual inverse of novelty_score.

• Presence of multiple highly relevant papers
• Cross-citations between recovered papers
• Age of related publications

Typical threshold: knownness_score > 0.75 → KNOWN_LINK classification

📋

Evidence Requirements

Each classification must be backed by evidence to ensure system reliability:

Minimum references

min_evidence_refs = 2 (default)

At least 2 recovered documents are required to make a reliable classification.

Truth sources

• OpenAlex (250M+ works)
• Semantic Scholar (200M+ papers)
• Real-time query

Quantifiable Value

The real impact of automated validation

~70%

Of AI-generated ideas without validation are redundant or existing

10-20x

Faster than exhaustive manual literature review

$$$

Savings in research resources avoiding duplication of efforts

Technical Details

How it works under the hood

🔍

Query Generation

From the proposed idea, the system generates multiple optimized queries:

• Main query: Key concepts of the idea
• Alternative queries: Synonyms and paraphrases
• Semantic expansion: Related concepts

📊

Scoring & Ranking

Retrieved papers are ranked by relevance and top-K is analyzed:

• Semantic similarity: Idea vs abstract embeddings
• Citation count: Weighted by paper impact
• Recency: More recent papers have higher weight

🤖

LLM Evaluator

Model specialized in critical idea review:

• Chain-of-Thought: Explicit step-by-step reasoning
• Few-shot examples: Curated evaluation examples
• Structured output: JSON with state + rationale

📝

Output Enrichment

Each result includes actionable metadata:

• State: NOVEL_BRIDGE / EMERGING_LINK / KNOWN_LINK / UNCERTAIN
• Confidence score: 0-1 certainty level
• References: Direct links to related papers

Validate the Novelty of Your Ideas Before Investing Resources

Don't let months of research end in a "this was already done in 2019". Validate automatically before you start.

📧 Request Demo 💡 See Idea Generation

Novelty Validation The System Doesn't Hallucinate

The Problem of "Novel Ideas"

The Solution: Truth-Checking with Global Databases

Semantic Scholar

OpenAlex

Evidence-Based Validation Process

🔍 Evidence Retrieval

🤖 Automated Judgment

📊 State Classification

Novelty Validation States

NOVEL_BRIDGE

EMERGING_LINK

KNOWN_LINK

UNCERTAIN

External Judge Decisions

NOVEL

KNOWN

EMERGING

Scoring Metrics

novelty_score

knownness_score

Evidence Requirements

Quantifiable Value

Technical Details

Query Generation

Scoring & Ranking

LLM Evaluator

Output Enrichment

Validate the Novelty of Your Ideas Before Investing Resources

Novelty Validation
The System Doesn't Hallucinate