Intended to make it easier for developers to create AI-powered applications with world-class performance, NVIDIA and Google today announced three new collaborations at Google I/O ’24. Using TensorRT-LLM, NVIDIA is working with Google to optimize two new models it introduced at the event: Gemma 2 and PaliGemma. These models are built from the same research and […]
Amazon Adds $2.75B to Stake in GenAI Startup Anthropic
Amazon announced it has made its biggest-ever investment, $2.75 billion, in OpenAI/Chat-GPT competitor Anthropic, another indication that the generative AI phenomenon continues to heat up. Today’s news follows Amazon and Anthropic announcing an earlier $1.25 billion investment last September – the announcement today brings the total investment to $4 billion. “We have a notable history with […]
Oriole Networks Raises £10m for Faster LLM Training
London, 27 March 2024: Oriole Networks – a startup using light to train LLMs faster with less power – has raised £10 million in seed funding to improve AI performance and adoption, and solve AI’s energy problem. The round, which the company said is one of the UK’s largest seed raises in recent years, was co-led […]
Datasaur Launches LLM Lab for ChatGPT and Similar Models
Oct. 27, 2023 — Datasaur, a natural language processing (NLP) data-labeling platform, today launched LLM Lab, an interface designed for data scientists and engineers to build and train custom LLM models like ChatGPT. The product will provide a wide range of features for users to test different foundation models, connect to their own internal documents, […]
Hyperion: HPC Community’s Interest in LLMs Has ‘Exploded,’ with Complexity, Cost Concerns
HPC-AI industry analyst firm Hyperion Research said its new study on Large Language Models in the HPC community shows that interest in LLMs has exploded in the last six months driven by unique capabilities of the technology to answer queries, generate concise summaries, and even produce unique works of fiction….
Large Language Models: The Largeness, the Power and the ‘Emergent’ Mystery
Large language models fit the classic model of a red-hot technology in an early stage of commercial viability: there’s more talk about it than knowledge, and FOMO – the fear that your competitors are implementing it at your peril – is helping to drive explosive demand. There’s also an allure and mystery around LLMs: some of the awe-inspiring “zero shot” things they do surprise even the data scientists who trained the models (more on this below). Against this backdrop….