News “A really big deal”—Dolly is a free, open source, ChatGPT-style AI model

The Helper

Necromancy Power over 9000
Staff member
Reaction score
1,698
dolly_hero.jpg

Dolly 2.0 could spark a new wave of fully open source LLMs similar to ChatGPT.

On Wednesday, Databricks released Dolly 2.0, reportedly the first open source, instruction-following large language model (LLM) for commercial use that has been fine-tuned on a human-generated data set. It could serve as a compelling starting point for homebrew ChatGPT competitors.

Databricks is an American enterprise software company founded in 2013 by the creators of Apache Spark. They provide a web-based platform for working with Spark for big data and machine learning. By releasing Dolly, Databricks hopes to allow organizations to create and customize LLMs "without paying for API access or sharing data with third parties," according to the Dolly launch blog post.

Dolly 2.0, its new 12 billion-parameter model, is based on EleutherAI's pythia model family and exclusively fine-tuned on training data (called "databricks-dolly-15k") crowdsourced from Databricks employees. That calibration gives it abilities more in line with OpenAI's ChatGPT, which is better at answering questions and engaging in dialogue as a chatbot than a raw LLM that has not been fine-tuned.

Dolly 1.0, released in March, faced limitations regarding commercial use due to the training data, which contained output from ChatGPT (thanks to Alpaca) and was subject to OpenAI's terms of service. To address this issue, the team at Databricks sought to create a new data set that would allow commercial use.

 

Attachments

  • dolly_hero.jpg
    dolly_hero.jpg
    48.3 KB · Views: 38
General chit-chat
Help Users
  • No one is chatting at the moment.

      The Helper Discord

      Members online

      Affiliates

      Hive Workshop NUON Dome World Editor Tutorials

      Network Sponsors

      Apex Steel Pipe - Buys and sells Steel Pipe.
      Top