Corpus Generation Pipeline

Follow these stages to create your research corpus

1
Web Source Compilation
Generate and refine search queries
2
Data Extraction
Collect and filter web content
3
Data Cleaning
Process and prepare final corpus
Step 1: Corpus Description

Your task: Describe the content you want to collect in detail.

How it works: Minerva will leverage LLMs to gain an understanding of your corpus subject.