Endpoints
Base URL: https://sn1.api.macrocosmos.ai
https://sn1.api.macrocosmos.aiMain endpoint that handles both regular, multi step reasoning, test time inference, and mixture of miners chat completion.
Request model for the /v1/chat/completions endpoint.
List of specific miner UIDs to query. If not provided, miners will be selected automatically.
[1,2,3]Random seed for reproducible results. If not provided, a random seed will be generated.
42Task identifier to choose the inference type.
InferenceTaskExample: InferenceTaskModel identifier to filter available miners.
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4Enable step-by-step reasoning mode that shows the model's thinking process.
falseEnable mixture of miners mode that combines responses from multiple miners.
falseParameters to control text generation, such as temperature, top_p, etc.
{"temperature":0.7,"top_p":0.95,"top_k":50,"max_new_tokens":1024,"do_sample":true}Example: {"do_sample":true,"max_new_tokens":512,"temperature":0.7,"top_k":50,"top_p":0.95}Inference mode to use for the task.
Reasoning-FastEnable JSON format for the response.
falseExample: trueEnable streaming for the response.
falseExample: trueSuccessful response with streaming text
Validation Error
Internal server error or no available miners
Retrieves information from the web based on a search query using multiple miners.
Request model for the /web_retrieval endpoint.
List of specific miner UIDs to query. If not provided, miners will be selected automatically.
[1,2,3]The query to search for on the web.
latest advancements in quantum computingNumber of miners to query for results.
3Example: 15Maximum number of results to return in the response.
1Example: 5Maximum time to wait for responses in seconds.
10Example: 15Successful response with web search results
Response model for the /web_retrieval endpoint.
Validation Error
Internal server error, no available miners, or no successful miner responses
Last updated
