Skip to content

Latest commit

 

History

History
47 lines (46 loc) · 25.7 KB

Topic_NVIDIA.md

File metadata and controls

47 lines (46 loc) · 25.7 KB

NVIDIA news

Title Summary Topics Week
OpenAI and rivals seek new path to smarter AI as current methods hit limitations 🟢 AI companies like OpenAI are innovating training techniques to overcome limits of large language models. Techniques behind OpenAI’s o1 model focus on human-like reasoning, shifting away from “bigger is better.” This approach may reshape AI resources and impact Nvidia’s dominance in AI chips. Rivals are also pursuing similar advancements. NVIDIA 🎮, AI Chips and GPUs 🖥️, OpenAI 🌟 2024-11-18
Amazon ready to use its own AI chips, reduce its dependence on Nvidia 🟢 Amazon is set to launch Trainium 2 AI chips, aiming to reduce reliance on Nvidia and cut costs for Amazon Web Services (AWS) customers. These chips promise efficiency and savings, attracting users like Anthropic and Databricks. This move highlights a growing trend of tech giants developing custom chips to drive AI growth. Amazon 🌳, NVIDIA 🎮, AI Chips and GPUs 🖥️, Anthropic 🌐 2024-11-18
Meta is using more than 100,000 Nvidia H100 AI GPUs to train Llama-4 🟢 Meta is utilizing over 100,000 Nvidia H100 AI GPUs to develop Llama 4, an advanced AI model with improved modalities and reasoning capabilities, positioning itself competitively against Microsoft and Google. Despite the significant power demands, Meta plans to release Llama models for free to encourage broader development and application. NVIDIA 🎮, AI Chips and GPUs 🖥️, LLaMA 🦙, Meta ♾, Model release 🎉 2024-11-11
Nvidia just dropped a new AI model on the level of OpenAI’s GPT-4 🟢 Nvidia has introduced the Llama-3.1-Nemotron-70B-Instruct AI model, outperforming GPT-4 in benchmarks. Built on Meta’s Llama 3.1, it features advanced training for enhanced language capabilities. This marks Nvidia’s strategic pivot from GPU manufacturing to AI software, potentially reshaping the industry with powerful, customizable AI solutions and challenging current AI leaders. Model release 🎉, NVIDIA 🎮, LLaMA 🦙, OpenAI 🌟 2024-10-21
Nvidia just dropped a bombshell: Its new AI model is open, massive, and ready to rival GPT-4 🟢 Nvidia released the new NVLM 1.0 family of large multimodal language models, led by the 72 billion parameter NVLM-D-72B. Multimodal AI (image, video, audio) 📸, Model release 🎉, NVIDIA 🎮, GPT-4 and GPT-4 turbo 🚀 2024-10-07
AMD is turning its back on flagship gaming GPUs to chase AI first 🟢 AMD is prioritizing AI development over flagship gaming GPUs to achieve a larger market share and attract developer support. According to Jack Huynh, the goal is to reach a 40% market share to compete with Nvidia and optimize AMD platforms for developers before potentially re-focusing on gaming GPUs. NVIDIA 🎮, AI Chips and GPUs 🖥️ 2024-09-16
Elon Musk is putting his AI chips to work — and he’s catching up with Mark Zuckerberg 🟢 Elon Musk’s xAI has launched Colossus, a major training cluster boasting 100,000 Nvidia H100 GPUs, making it the world’s most powerful AI system. Built in 122 days in Memphis and set to double in capacity soon, this development comes amid a global GPU shortage, with rivals like Meta and Microsoft also competing for AI supremacy. NVIDIA 🎮, AI Chips and GPUs 🖥️, Meta ♾, Microsoft 🪟 2024-09-09
Apple and Nvidia may invest in OpenAI 🟢 Apple and Nvidia are exploring investments in OpenAI to enhance their AI capabilities and maintain a competitive edge, though investment specifics remain undisclosed. Apple 🍏, Funding 💰, NVIDIA 🎮, OpenAI 🌟 2024-09-02
Introducing Cerebras Inference: AI at Instant Speed 🟢 Cerebras has achieved a significant speed advantage in AI language model inference, delivering 1,800 tokens per second on Llama3.1 8B and 450 tokens per second on Llama3.1 70B models, outperforms NVIDIA’s GPU-based solutions by 20-fold and is 2.4 times faster than Groq for the 8B model. Notably, Cerebras stands alone in offering immediate responses at a rate of 450 tokens per second on the 70B model. NVIDIA 🎮, AI Chips and GPUs 🖥️, LLaMA 🦙 2024-09-02
Nvidia unveils AI model StormCast for advanced weather prediction 🟢 Nvidia has launched StormCast, an AI-driven model on its Earth-2 platform, advancing mesoscale weather prediction with simulations of atmospheric dynamics. It achieves a 10% accuracy improvement over traditional six-hour forecasts, contributing to efficient disaster planning and positioning Nvidia alongside other tech giants like Google, Microsoft, and IBM in AI climate technology. NVIDIA 🎮, Google 🔍, Microsoft 🪟 2024-08-26
AMD is becoming an AI chip company, just like Nvidia 🟢 AMD’s Q2 2024 earnings highlight a strategic shift towards AI, with data center products like the Instinct MI300 accelerator leading sales, which have surged by 115%. The MI300 broke $1 billion in quarterly sales, indicating AMD’s intent to annually release AI chips to rival Nvidia’s market dominance. NVIDIA 🎮, AI Chips and GPUs 🖥️ 2024-08-12
Mistral NeMo 🟢 Mistral, in collaboration with NVIDIA, has launched the 12B parameter Mistral NeMo model, featuring a 128k token context window, FP8 inference compatibility, and a cutting-edge Tekken tokenizer. It is open-sourced under Apache 2.0, boasts enhanced multilingual capabilities, and outperforms the previous 7B version in instruction-following tasks. Model release 🎉, NVIDIA 🎮, Mistral 🌬️ 2024-07-22
Apple, Nvidia, Anthropic Used Thousands of Swiped YouTube Videos to Train AI 🔴 An investigation revealed that major AI firms, including Apple, Nvidia, and Anthropic, have trained their AI models using subtitles from over 173k YouTube videos, potentially breaching YouTube’s anti-data harvesting policy and raising issues on creators’ rights and compensation. AI and copyright ©️, AI datasets 📊, Apple 🍏, NVIDIA 🎮, Anthropic 🌐 2024-07-22
Elon Musk: Grok 2 AI Arrives in August 🟢 Elon Musk has unveiled plans for Grok 2, a new AI model expected in August 2024, promising enhanced efficiency. His company anticipates an upgrade to Grok 3 by the end of the same year, utilizing cutting-edge Nvidia GPU technology. Grok 🐦, NVIDIA 🎮, AI Chips and GPUs 🖥️ 2024-07-08
NVIDIA Releases Open Synthetic Data Generation Pipeline for Training Large Language Models 🟢 NVIDIA has launched Nemotron-4 340B, an open suite of models designed to create synthetic data for training language models in diverse sectors. The suite, which includes base, instruct, and reward models, focuses on improving the quality and availability of training data. It is optimized for NVIDIA NeMo and TensorRT-LLM, providing support for more efficient training and inference of LLMs. AI datasets 📊, NVIDIA 🎮 2024-06-24
Nvidia shipped 3.76M data center GPUs in 2023 — dominates business with 98% revenue share 🟢 In 2023, Nvidia consolidated its position in the data center GPU market with a 98% share by distributing 3.76 million units and achieved a remarkable 126% revenue increase since 2020, reaching $60.9 billion, even amidst U.S. export restrictions and manufacturing hurdles. NVIDIA 🎮, AI Chips and GPUs 🖥️ 2024-06-17
Nvidia is now more valuable than Apple at $3.01 trillion 🟢 Nvidia has achieved a market capitalization of $3.01 trillion, propelled by the artificial intelligence surge, overtaking Apple to become the world’s second most valuable company. Apple 🍏, NVIDIA 🎮 2024-06-10
AMD unveils new AI chips to compete with Nvidia 🟢 AMD is challenging Nvidia’s leadership in AI with upcoming releases: the MI325X in 2024, and the MI350/MI400 series in 2025–2026, promising notable performance boosts to satisfy increasing AI demands. NVIDIA 🎮, AI Chips and GPUs 🖥️ 2024-06-10
Nvidia and Salesforce may double down on AI startup Cohere in $450 million round 🟢 Generative AI startup Cohere has secured a $450 million funding round led by Nvidia and Salesforce, alongside new backers such as Cisco and PSP Investments, boosting its valuation to $5 billion from its prior $2.2 billion mark. The company also disclosed an annualized revenue of $35 million. Funding 💰, NVIDIA 🎮 2024-06-10
Nvidia Stock Surges as Sales Forecast Delivers on AI Hopes 🟢 Nvidia’s stock surged 9.3% after a promising sales forecast, pointing to a robust demand for AI technologies. The $28 billion projected Q2 revenue exceeds expectations, highlighting the company’s strong position in the AI market, buoyed by their new Blackwell chips and significant data-center revenue. NVIDIA 🎮, AI Chips and GPUs 🖥️ 2024-05-27
Google’s new chips look to challenge Nvidia, Microsoft and Amazon 🟢 Google has unveiled the Cloud TPU v5p, an AI chip that delivers nearly triple the training speed of its predecessor, the TPU v4, reinforcing its position in AI services and hardware. At the Google Cloud Next event, CEO Pichai highlighted the company’s AI advancements and collaborations, including the use of the A3 supercomputer and Blackwell chips in the AI Hypercomputer. Additionally, Google introduced the Google Axion CPU, an Arm-based processor that competes with similar offerings from Microsoft and Amazon, boasting a 30% performance improvement and better energy efficiency. Amazon 🌳, NVIDIA 🎮, AI Chips and GPUs 🖥️, Google 🔍, Microsoft 🪟 2024-04-23
Lambda Announces $500M GPU-Backed Facility to Expand Cloud for AI 🟢 Lambda has successfully secured $500 million in funding to enhance its AI-oriented cloud services, powered by NVIDIA GPUs, following a Series C investment round. Funding 💰, NVIDIA 🎮, AI Chips and GPUs 🖥️ 2024-04-08
‘We Created a Processor for the Generative AI Era,’ NVIDIA CEO Says 🟢 NVIDIA CEO Jensen Huang announced the NVIDIA Blackwell computing platform at the GTC conference, aimed at advancing generative AI with superior training and inference capabilities. The platform includes enhanced interconnects for better performance and scalability. NVIDIA also launched NIM microservices for tailored AI deployment and Omniverse Cloud APIs for sophisticated simulation, signaling a transformative impact on sectors like healthcare and robotics. AI in healthcare 🏥, Robotics 🤖, NVIDIA 🎮, AI Chips and GPUs 🖥️ 2024-03-25
Nvidia’s data center GPU sales grow by a stunning 409% on huge demand for AI chips 🟢 Nvidia has experienced a significant surge in GPU sales, reporting a 409% increase due in large part to the rising demand for AI technologies. With Q4 earnings and revenue significantly outpacing Wall Street forecasts, the company’s financials have thrived on the back of robust sales from their Hopper GPU series, including the H100. NVIDIA 🎮, AI Chips and GPUs 🖥️ 2024-02-26
GPU cloud company Together AI to raise $100m 🟢 Together AI, a GPU cloud company specializing in open-source AI tools and Nvidia server chip access, is nearing a $100 million funding round led by Salesforce Ventures, potentially elevating its valuation to $1 billion. Funding 💰, NVIDIA 🎮, AI Chips and GPUs 🖥️ 2024-02-19
Hugging Face and Google partner for open AI collaboration 🟢 Hugging Face has partnered with Google Cloud, providing users with access to enhanced AI models and integration with Google Cloud services like GKE and Vertex AI, utilizing Google TPUs and NVIDIA H100 GPUs. Hugging Face 🤗, NVIDIA 🎮, AI Chips and GPUs 🖥️, Google 🔍 2024-01-29
Mark Zuckerberg indicates Meta is spending billions of dollars on Nvidia AI chips 🟢 Meta plans a significant investment in AI research by integrating 350,000 Nvidia H100 GPUs by 2024. Given their high cost — estimated between $25K-$30K — this investment underlines Meta’s commitment to scaling up computing power. Overall, Meta’s strategy to amass the computational equivalent of 600K H100 GPUs highlights a substantial push to enhance its AI capabilities. NVIDIA 🎮, AI Chips and GPUs 🖥️, Meta ♾ 2024-01-22
Nvidia unveils H200, its newest high-end chip for training AI models 🟢 Nvidia introduces the H200 GPU, an upgraded version with 141GB of high-bandwidth memory. This enhancement helps with the inference process in training large AI models. Set to be released in Q2 2024, the H200 will compete with AMD’s MI300X GPU, which also offers increased memory capacity for handling big models. NVIDIA 🎮, AI Chips and GPUs 🖥️ 2023-11-20
NVIDIA makes Pandas much faster leveraging GPUs 🟢 NVIDIA has significantly enhanced the Pandas library, achieving up to 150 times faster performance by capitalizing on GPUs. With the new cudf.pandas module, operations are seamlessly executed on either the GPU or CPU, providing automatic synchronization and efficient switching between the two. NVIDIA 🎮, AI Chips and GPUs 🖥️ 2023-11-13
New AWS service lets customers rent Nvidia GPUs for quick AI projects 🟢 AWS has introduced Amazon Elastic Compute Cloud (EC2) Capacity Blocks for ML, enabling AI practitioners to reserve Nvidia GPUs for specific time periods. This new service grants access to Nvidia H100 Tensor Core GPU instances, allowing users to reserve instances for up to 14 days ahead of time, with shutdowns scheduled automatically. Amazon 🌳, NVIDIA 🎮, AI Chips and GPUs 🖥️ 2023-11-13
New AWS service lets customers rent Nvidia GPUs for quick AI projects 🟢 AWS launches Amazon Elastic Compute Cloud (EC2) Capacity Blocks for ML, allowing customers to rent Nvidia GPUs for specific time frames. Amazon 🌳, NVIDIA 🎮, AI Chips and GPUs 🖥️ 2023-11-06
Microsoft to Unveil In-House AI Chip, Reducing Reliance on NVIDIA 🔴 Microsoft is soon launching its own AI chip called Athena, aiming to reduce reliance on NVIDIA GPUs and compete against NVIDIA’s H100 GPU for AI acceleration in data centers. NVIDIA 🎮, AI Chips and GPUs 🖥️, Microsoft 🪟 2023-10-10
LLM Startup Embraces AMD GPUs, Says ROCm Has ‘Parity’ With Nvidia’s CUDA Platform 🟢 A startup called Lamini is using over 100 AMD Instinct MI200 GPUs and found that AMD’s ROCm software platform rivals Nvidia’s CUDA platform. They claim that running a large language model on their platform is 10x cheaper than on Amazon Web Services. AI Chips and GPUs 🖥️, NVIDIA 🎮 2023-10-02
Open ASR Leaderboard 🟢 Hugging Face has released a speech-to-text leaderboard that ranks and assesses speech recognition models on their platform. The current top performers are NVIDIA FastConformer and OpenAI Whisper, with a focus on English speech recognition. Multilingual evaluation will be added in future updates. Hugging Face 🤗, Speech-to-text 🎤, NVIDIA 🎮, OpenAI 🌟 2023-09-11
NVIDIA and Hugging Face to Connect Millions of Developers to Generative AI Supercomputing 🟢 NVIDIA and Hugging Face have partnered to offer AI developers access to high-performance GPUs for deep learning through DGX Cloud integration. Hugging Face 🤗, NVIDIA 🎮, AI Chips and GPUs 🖥️ 2023-08-14
Cerebras Systems signs $100 million AI supercomputer deal with UAE’s G42 🟢 Cerebras Systems has struck a $100 million deal with G42, marking the debut of AI supercomputers that could potentially challenge Nvidia’s market position. In response to chip shortages, cloud computing providers are seeking alternative solutions. To accelerate the rollout, Cerebras will construct three Condor Galaxy systems in the United States, with the first supercomputer set to go online this year, followed by two others in early 2024. Funding 💰, NVIDIA 🎮, AI Chips and GPUs 🖥️ 2023-07-24
Nvidia deepens bets on AI in drug discovery with Recursion investment 🟢 Nvidia invests $50M in Recursion, a biotech company using AI to revolutionize drug discovery. This partnership allows Recursion to utilize Nvidia’s platform and access their advanced AI technology. Recursion’s share price surged by 83% post-announcement, highlighting the market’s recognition of AI’s significance in drug discovery. AI in healthcare 🏥, Funding 💰, NVIDIA 🎮, AI Chips and GPUs 🖥️ 2023-07-17
Inside China’s underground market for high-end Nvidia AI chips 🔴 Despite U.S. export restrictions, Chinese demand for high-end Nvidia AI chips remains strong and vendors in Shenzhen and Hong Kong have emerged to offer them at steep prices in a de facto underground market. The exact volume of chip flow is unknown, but local authorities are also reported to be buyers. NVIDIA 🎮, AI Chips and GPUs 🖥️, AI regulation 📜 2023-06-26
China’s ByteDance Has Gobbled Up $1 Billion of Nvidia GPUs for AI This Year 🟢 Chinese tech giant ByteDance has reportedly invested $1 billion in Nvidia’s HPC products, including the A100 and H800 cards, totaling about 100,000 units, to fulfill the high demand for AI technology. China recognizes the impact of AI on the global market and has embraced it, investing heavily in HPC hardware. NVIDIA 🎮, AI Chips and GPUs 🖥️ 2023-06-26
Nvidia demo about speaking to AI game characters 🟢 Nvidia showcases a powerful AI-powered demo of conversational AI for game characters that enhances realism and engages players, providing game developers a tool to improve their games’ storytelling and overall engagement. NVIDIA 🎮, Multimodal AI (image, video, audio) 📸, Text-to-speech 📢, Speech-to-text 🎤 2023-06-06
Why Nvidia is suddenly one of the most valuable companies in the world 🟢 NVIDIA’s GPUs have become a crucial component in developing AI, leading its worth to $939.3 billion; with AI applications requiring huge amounts of data, companies are buying thousands of NVIDIA’s expensive chips. NVIDIA’s dominance in the industry is predicted to persist as startups compete with big tech for access to its costly GPUs. NVIDIA 🎮, AI Chips and GPUs 🖥️ 2023-06-06
Microsoft spent hundreds of millions of dollars on a ChatGPT supercomputer 🟢 Microsoft says it connected tens of thousands of Nvidia A100 chips and reworked server racks to build the hardware behind ChatGPT and its own Bing AI bot. NVIDIA 🎮, AI Chips and GPUs 🖥️, Microsoft 🪟, ChatGPT 💬 2023-03-20
Nvidia Debuts Generative AI Model for Medical Research 🟢 Nvidia and Evozyne have debuted ProT-VAE, a generative AI model capable of producing proteins to use in medicine and other industries. The model uses deep learning and generative AI to synthesize variants of proteins to create more effective enzymes. AI in healthcare 🏥, NVIDIA 🎮 2023-01-23