Large Language Model

A large language model (LLM) is a language model (a computing algorithm created typically for speech recognition and language generation) consisting of an artificial neural network with billions or trillions of parameters, trained on billions of words of text using self-learning.

Neural language models such as ChatGPT (created by OpenAI) are found to capture much of the syntax and semantics of human language, demonstrate considerable general knowledge about the world, and are able to "memorize" a great quantity of facts during training. As such, ChatGPT is regarded as an artificial intelligence chatbot. It uses a generative pre-trained transformer (GPT), a type of LLM, currently GPT-4. ChatGPT initially used a Microsoft Azure supercomputing infrastructure powered by Nvidia GPUs costing hundreds of millions of dollars.

ChatGPT can write and debug computer programs, mimic people, compose music, plays, stories and student essays, answer test questions, write poetry and song lyrics, and translate and summarize text. It can even be taught to play games.

The power of ChatGPT is revolutionary and it may be a precursor to an Artificial General Intelligence or Superintelligence. Currently there are concerns with it being an existential threat.