What is Hugging Face?

Hugging Face is an American developer of tools that enable users to build applications with machine learning. 

The company, which was founded in 2016 by French entrepreneurs Julien Chaumond, Thomas Wolf, and Clément Delangue, initially offered an app-based chatbot for the teenage demographic.

A short history of Hugging Face

The chatbot was designed as a fun and entertaining communicator that could talk about a diverse range of subjects but also detect emotions in the user’s text and alter its tone to suit. 

Underpinning this chatbot was an in-house natural language processing (NLP) model and also a library of pre-trained models. These would serve as the foundation for early PyTorch transformers which facilitated compatibility between PyTorch and TensorFlow 2.0.

When the seminal paper Attention Is All You Need was released in 2017, the use of transformers shifted to the NLP space. The company, which had previously released part of the library powering its chatbot on GitHub, then standardized the differences between models that were trained with different libraries.

What does Hugging Face offer?

Open-source platform

The company’s open-source platform, known as HF Hub, provides free access to anyone who wants to explore, experiment, collaborate, and build tech with ML. HF Hub offers over 60,000 models, 6,000 datasets, and 6,000 ML demo apps.

Inside HF Hub, users will be able to create or access these ML assets:

  • Models – initially, the hub was a repository for state-of-the-art NLP models but has since expanded to computer vision, speech, RL, chemistry, biology, and time-series models. Hugging Face incorporates transfer learning so that any team can use a pre-trained model and easily fine-tune it for their own purposes.
  • Datasets – HF Hub datasets feature a variety of data for various modalities, languages, and domains. Users can locate the best dataset for their use case by using the search bar, and several tools make data manipulation fast and efficient. 
  • Spaces – these are interactive, browser-based apps that enable users to showcase their models to business stakeholders in particular. 

The hub is also a space where individuals can solicit feedback on their models. Teams can also peer review each of the ML assets and create organizations where said assets can be managed collaboratively. 

Private Hub

The Hugging Face Private Hub is a place where companies can utilize the company’s complete ecosystem in a private and compliant environment. 

Private Hub was established in response to the fact that 87% of data science projects never make it into production, with the primary causes being siloed organizations, task and effort duplication, and data that lives in different formats.

To accelerate the ML development process and make collaboration simpler and more productive, Private Hub offers a unified set of tools from research to production. 

Data scientists can now work with over 25 open-source transformer libraries, with over 10,000 companies such as AWS, Intel, Microsoft, Meta AI, and Grammarly using the technology.

Subscription-based NLP features for deployment

While the HF Hub and its various features and tools are free, the company does offer certain upgrades and additional features for a price

Let’s take a look at a few of these below:

  • Spaces Hardware – for those who want to upgrade their Spaces with custom hardware, various fees are charged. A basic CPU upgrade, for example, is a mere $0.03 per hour, while an upgrade to the Nvidia A100 GPU is $4.13 per hour.
  • Inference Endpoints – a secure production solution that enables users to easily deploy any model on dedicated, autoscaling infrastructure. Various options exist. For Azure CPU instances on the Intel Xeon architecture, 8 vCPUs, and 16GB memory, Hugging Face charges $0.48 per hour.
  • Pro Account – for $9 per month, users with a Pro account receive a special badge, unlimited models and datasets, early access to new features, and higher limits for both the AutoTrain and Free Inference API features.
  • AutoTrain – a feature that creates, trains, evaluates and deploys state-of-the-art ML models without code. Free to use initially, but extra images, rows, and models are available for a price.
  • Enterprise – for enterprise customers who want to train their own LLMs, Hugging Face sells support from over 40 machine learning experts. Prices are available on request.

Key takeaways:

  • Hugging Face is a developer of tools that enable users to build applications with machine learning. The company, which was founded in 2016 by Julien Chaumond, Thomas Wolf, and Clément Delangue, was conceived initially to create an app-based chatbot for teenagers. 
  • The company’s open-source platform (HF Hub) provides free access to anyone who wants to explore, experiment, collaborate, and build tech with ML. HF Hub offers over 60,000 models, 6,000 datasets, and 6,000 ML demo apps. Private Hub is for clients who want to utilize the company’s complete ecosystem in a private and compliant environment. 
  • While the HF Hub and its various features and tools are free, the company does offer certain upgrades and additional features for a price. These include upgrades to Spaces, Inference Endpoints, and AutoTrain, as well as a premium support option for enterprises and a Pro account with access to various perks.

Key Highlights

  • Founding and Early Focus: Hugging Face is an American developer of tools for building applications with machine learning. It was founded in 2016 by French entrepreneurs Julien Chaumond, Thomas Wolf, and Clément Delangue. Initially, the company created an app-based chatbot designed for teenagers. The chatbot had the capability to discuss various topics and detect emotions in user input, adjusting its tone accordingly.
  • NLP Models and Libraries: Hugging Face’s chatbot was powered by an in-house natural language processing (NLP) model and a library of pre-trained models. This laid the foundation for the development of PyTorch transformers, which facilitated compatibility between the PyTorch and TensorFlow 2.0 libraries. The release of the paper “Attention Is All You Need” in 2017 marked a shift towards using transformers in the NLP field.
  • HF Hub – Open Source Platform: The company’s open-source platform, known as HF Hub, provides free access to a wide range of machine learning resources. It offers over 60,000 models, 6,000 datasets, and 6,000 ML demo apps. HF Hub covers various domains beyond NLP, including computer vision, speech, reinforcement learning, chemistry, biology, and time-series models. The platform incorporates transfer learning, allowing teams to fine-tune pre-trained models for their specific use cases.
  • Datasets and Spaces: HF Hub includes datasets for various modalities, languages, and domains. The platform enables efficient data manipulation and searching for suitable datasets. “Spaces” are interactive, browser-based apps that allow users to showcase their models, particularly to business stakeholders. The platform supports collaboration, feedback solicitation, and peer review of machine learning assets.
  • Private Hub for Enterprises: Hugging Face introduced the Private Hub to address challenges in data science projects not reaching production. Private Hub offers a unified environment for ML development, spanning research to production. It enables companies to work with Hugging Face’s ecosystem in a private and compliant manner. Notable companies like AWS, Intel, Microsoft, Meta AI, and Grammarly utilize this technology.
  • Subscription-based Features: While many features of HF Hub are free, Hugging Face also offers subscription-based upgrades and additional features. These include hardware upgrades for Spaces, secure production deployment with Inference Endpoints, Pro Accounts with various perks, AutoTrain for ML model creation, and Enterprise support for training custom language models (LLMs).
  • Pro Account and Upgrades: The Pro Account, priced at $9 per month, provides users with benefits like a special badge, unlimited access to models and datasets, early feature access, and increased limits for AutoTrain and Free Inference API features.
  • AutoTrain and Enterprise: AutoTrain is a feature that automates the creation, training, evaluation, and deployment of advanced ML models without requiring code. It is initially free, but additional resources come at a cost. For enterprise customers, Hugging Face offers support from a team of over 40 machine learning experts, and pricing details are available upon request.

Read Next: History of OpenAI, AI Business Models, AI Economy.

Connected Business Model Analyses

AI Paradigm

current-AI-paradigm

Pre-Training

pre-training

Large Language Models

large-language-models-llms
Large language models (LLMs) are AI tools that can read, summarize, and translate text. This enables them to predict words and craft sentences that reflect how humans write and speak.

Generative Models

generative-models

Prompt Engineering

prompt-engineering
Prompt engineering is a natural language processing (NLP) concept that involves discovering inputs that yield desirable or useful results. Like most processes, the quality of the inputs determines the quality of the outputs in prompt engineering. Designing effective prompts increases the likelihood that the model will return a response that is both favorable and contextual. Developed by OpenAI, the CLIP (Contrastive Language-Image Pre-training) model is an example of a model that utilizes prompts to classify images and captions from over 400 million image-caption pairs.

OpenAI Organizational Structure

openai-organizational-structure
OpenAI is an artificial intelligence research laboratory that transitioned into a for-profit organization in 2019. The corporate structure is organized around two entities: OpenAI, Inc., which is a single-member Delaware LLC controlled by OpenAI non-profit, And OpenAI LP, which is a capped, for-profit organization. The OpenAI LP is governed by the board of OpenAI, Inc (the foundation), which acts as a General Partner. At the same time, Limited Partners comprise employees of the LP, some of the board members, and other investors like Reid Hoffman’s charitable foundation, Khosla Ventures, and Microsoft, the leading investor in the LP.

OpenAI Business Model

how-does-openai-make-money
OpenAI has built the foundational layer of the AI industry. With large generative models like GPT-3 and DALL-E, OpenAI offers API access to businesses that want to develop applications on top of its foundational models while being able to plug these models into their products and customize these models with proprietary data and additional AI features. On the other hand, OpenAI also released ChatGPT, developing around a freemium model. Microsoft also commercializes opener products through its commercial partnership.

OpenAI/Microsoft

openai-microsoft
OpenAI and Microsoft partnered up from a commercial standpoint. The history of the partnership started in 2016 and consolidated in 2019, with Microsoft investing a billion dollars into the partnership. It’s now taking a leap forward, with Microsoft in talks to put $10 billion into this partnership. Microsoft, through OpenAI, is developing its Azure AI Supercomputer while enhancing its Azure Enterprise Platform and integrating OpenAI’s models into its business and consumer products (GitHub, Office, Bing).

Stability AI Business Model

how-does-stability-ai-make-money
Stability AI is the entity behind Stable Diffusion. Stability makes money from our AI products and from providing AI consulting services to businesses. Stability AI monetizes Stable Diffusion via DreamStudio’s APIs. While it also releases it open-source for anyone to download and use. Stability AI also makes money via enterprise services, where its core development team offers the chance to enterprise customers to service, scale, and customize Stable Diffusion or other large generative models to their needs.

Stability AI Ecosystem

stability-ai-ecosystem

About The Author

Scroll to Top
FourWeekMBA