Cohere AI is an enterprise provider of high-performance large language models (LLMs) for contexts such as content generation, summarization, and search. The company was founded in 2019 by former Google Brain employees Aidan N. Gomez and Nick Frosst together with Canadian researcher and entrepreneur Ivan Zhang.
Cohere AI Business Model Explained
Scale AI Business Model
Foundational Layer
Data Labeling and Annotation:
Providing high-quality data labeling and annotation services using AI technologies.
Machine Learning Models:
Training machine learning models with labeled data to improve accuracy and performance.
Human-in-the-Loop:
Combining AI algorithms with human expertise to achieve precise data labeling and annotation.
Quality Control:
Implementing rigorous quality control measures to ensure accurate and reliable annotations.
Value Layer
High-Quality Data Annotations:
Providing accurate and reliable annotations for training machine learning models.
Scalability and Efficiency:
Scaling data labeling operations efficiently to handle large volumes of data.
Domain Expertise:
Leveraging industry-specific expertise to ensure specialized data labeling services.
Distribution Layer
Cloud-based Platform:
Providing a cloud-based platform for easy access to data labeling tools and services.
API Integration:
Offering API integration for seamless integration of data labeling into existing workflows.
Collaboration and Feedback:
Enabling collaboration and feedback loops between data labelers and customers.
Financial Layer
Usage-Based Pricing:
Charging customers based on the volume and complexity of data labeling tasks performed.
Enterprise Solutions:
Providing customized data labeling solutions for enterprise clients.
Data Marketplace:
Creating a marketplace for labeled data, connecting data providers and users.
Short history of Cohere
The origins of Cohere can be traced back to the 2017 paper Attention Is All You Need. The paper, which introduced a new type of neural network known as a transformer, was authored by Gomez and several other notable researchers such as Noam Shazeer and Niki Parmar.
Several years later, it became evident that transformers could be scaled up to create immense neural networks that performed well on language-related tasks. Gomez was inspired by this revelation and believed there was a commercialization opportunity.
In an interview with tech analysis and research firm Slator, Frosst explained Cohere’s business model: “We upfront the cost of creating these massive transformer neural networks and then we hook up companies to them and they pay for usage, so that is a win-win situation for everybody. Companies get access to a thing that they could not themselves build and we get to build a company that is adding value in some way.”
Today, Cohere AI’s mission is to become the default NLP toolkit for every type of developer.
What does Cohere AI offer?
Cohere sells an API that enables developers to solve any language-related problem with large neural networks and state-of-the-art language AI.
The company’s products are broadly categorized according to three different tasks.
1 – Text generation
- Summarize – an LLM-powered text summarizer that encapsulates the key points of a document and can be used at scale.
- Generate – a content creator that produces unique content for various purposes such as landing page, email, and product description copy, and
- Command Model – a text generation model that is trained to follow user commands. Command Model is Cohere’s flagship product and was awarded the most capable LLM in the world by Stanford University’s Holistic Evaluation of Language Models (HELM).
2 – Text classification
- Classify – Cohere’s text classification product enables customers to organize information to enhance content moderation, analysis, and chatbot experiences. For example, it can be used to tag inbound customer support requests or identify positive and negative social media sentiments.
3 – Text retrieval
- Embed – for ML teams who want to build their own applications. Embed can discover trends and is available in over 100 languages.
- Semantic Search – to build powerful search capabilities that find text, articles, and documents irrespective of the language used in the search query. Importantly, Semantic Search returns information based on the meaning of a query (and not just keywords).
- Rerank – a tool that provides a semantic boost to the search quality of any keyword or vector search system. In essence, Rerank analyzes search results from existing tools and ranks them based on semantic relevance. This delivers richer, more relevant results and requires minimal intervention or coding experience from the user.
How does Cohere make money?
Cohere offers free, rate-limited usage for developers who wish to learn or prototype and be part of a community. However, a fee is applicable for those who want to go into production, train custom models, access all endpoints, and receive enhanced customer support.
This fee is based on the concept of tokens, which are simply the parts of words, entire words, or punctuation marks LLMs use to understand information. Common words such as “water” will have their own unique token, while longer, less frequent words may need to be encoded into three or four tokens.
The number of tokens required also depends on the complexity of the text and, at present, Cohere’s models are limited to sequences of up to 4096 tokens.
The prices for each token depend on the product and whether the customer desires a default or custom model:
- Embed – 40 cents per 1 million tokens in a default model, or 80 cents for a custom model.
- Generate – $15 per 1 million tokens (default), $30 (custom).
- Classify – 20 cents per 1,000 classifications. Here, Cohere considers that each text input counts as one classification.
- Summarize – $15 per 1 million tokens, and
- Rerank – $1 per 1,000 search units. One search unit is equivalent to a query with up to 100 documents to be ranked.
Key takeaways:
- Cohere AI is an enterprise provider of high-performance large language models (LLMs) for contexts such as content generation, summarization, and search. The company was founded in 2019 by Aidan Gomez, Nick Frosst, and Ivan Zhang.
- Cohere sells an API that enables developers to solve any language-related problem with large neural networks and state-of-the-art language AI. The company’s products are broadly categorized according to text generation, retrieval, and classification.
- Cohere offers free, rate-limited usage for developers who wish to learn or prototype. However, a token-based fee is applicable for those who want to go into production, train custom models, access all endpoints, and receive enhanced customer support.
Key Highlights:
- Founders and Origins: Cohere AI, founded in 2019 by former Google Brain employees Aidan N. Gomez, Nick Frosst, and entrepreneur Ivan Zhang, focuses on providing high-performance large language models (LLMs) for various language-related tasks.
- Business Model Layers – Scale AI: Cohere AI’s business model is based on Scale AI, with multiple layers contributing to its operations and revenue generation.
- Foundational Layer – Data Annotation: Cohere AI offers data labeling and annotation services, utilizing both AI algorithms and human expertise to provide high-quality annotations. This forms the foundation for training machine learning models.
- Value Layer – High-Quality Annotations: The value layer involves delivering high-quality data annotations that enhance the accuracy and performance of machine learning models. This layer emphasizes accuracy, scalability, and domain expertise.
- Distribution Layer – Cloud Platform and Collaboration: Cohere AI provides a cloud-based platform with accessible data labeling tools and services. Additionally, they offer API integration for easy incorporation into existing workflows and encourage collaboration and feedback between data labelers and customers.
- Financial Layer – Usage-Based Pricing and Enterprise Solutions: The financial layer focuses on revenue generation through usage-based pricing. Customers are charged based on the volume and complexity of data labeling tasks. Cohere AI also offers enterprise solutions, catering to the specific needs of large clients.
- Partnership with Amazon AWS: Cohere AI partnered with Amazon AWS, making AWS its preferred cloud provider for building and scaling its AI models. This partnership likely includes cost-efficient computing infrastructure for the company.
- API Monetization: Cohere AI monetizes its open-source models through the DreamStudio API platform, offering computing resources and expertise to users wanting to access Stable Diffusion and other models via APIs.
- Consulting and Enterprise Services: Cohere AI offers consulting services, assisting enterprises in integrating AI into applications such as images, film, and VR. The company also provides enterprise services for customization and scalability.
- Diverse Product Offerings: Cohere AI’s products are categorized into text generation, classification, and retrieval, offering solutions such as summarization, content generation, command models, text classification, text embedding, semantic search, and reranking.
- Token-Based Fee Model: Cohere AI provides free, rate-limited usage for developers, while a token-based fee applies to those who want to use their API for production, train custom models, access all endpoints, and receive enhanced customer support. The token-based pricing varies depending on the product and model type.
Read Next: History of OpenAI, AI Business Models, AI Economy.
Connected Business Model Analyses
AI Paradigm





OpenAI Organizational Structure




Stability AI Ecosystem










