AI Time to Impact

New Post…AI: A Big Week for AI: Meta's New SOTA Model, UBI Study, GPT-4o Mini + Free Finetuning, and Voice Standards (7.26.24)

Sasha Krecinic — Fri, 26 Jul 2024 19:15:00 +0000

It has been another big week in AI. Most notably, there is a paradigm shift in the competition between open and closed-source models. Meta's latest release, Llama 3.1 405B, redefines open-source real-time AI performance with enhanced reasoning and multimodal capabilities. This new version is state-of-the-art for the open-source community, radically improving AI toolkits for AI startups and developers. But that's not all—today's edition also covers the broader implications of advancing AI with the results of the Sam Altman-backed UBI (universal basic income) study being released. OpenAI has also introduced GPT-4o mini, which is reportedly smarter and 60% cheaper than GPT-3.5 Turbo. Additionally, OpenAI has launched free fine-tuning for GPT-4o mini until September. We also saw that Daily released an open standard for Real-time Voice and Video Inference (RTVI-AI). These developments are significant because they make cutting-edge AI technology more accessible and affordable along with the potential implications of UBI. Based on today’s updates, it isn't crazy to imagine a world where your next dentist's appointment is booked by speaking to an AI agent. People could work less and potentially have a steady stream of money coming in each month.

Meta launches SOTA Open Source Model

First impacted: AI researchers, Data scientists

Meta has introduced Llama 3.1, a new set of foundation models designed to rival leading closed-source models in various tasks. These models, including a 405 billion parameter version, boast enhanced reasoning capabilities and a larger 128,000-token context window, along with multimodal features for image and video processing. Key to the model's size and performance are its improved data quality and scale, with training conducted on a diverse and high-quality dataset of 15 trillion multilingual tokens. Meta has made these models publicly available, including both pre-trained and post-trained versions, to foster innovation in the research community and promote the responsible development of artificial general intelligence (AGI). [fb.me] Share this story by email

OpenAI Launches GPT-4o Mini with Free Training Tokens

First impacted: AI developers, tech startups

OpenAI has launched GPT-4o mini, which they say is smarter and 60% cheaper than GPT-3.5 Turbo. OpenAI has also launched fine-tuning for GPT-4o mini. GPT-4o mini excels in reasoning, math, coding, and multimodal tasks, outperforming GPT-3.5 Turbo and other small models on several key benchmarks. OpenAI also mentioned in another release that they will be offering free fine-tuning for the model with the first 2 million training tokens per day are free until September 23! These super small and cheaper models are important for highly repetitive tasks that don’t need a larger or more expensive model (and because they use significantly less power, they are much better for the the wallet and the planet!) [openai.com] Share this story by email

Daily Launches Open Standard for Real-time Voice and Video AI

First impacted: Developers, AI researchers

Daily has launched an open standard for Real-time Voice and Video Inference (RTVI-AI) along with open-source JavaScript and React SDKs, with iOS and Android SDKs coming soon. According to the release, RTVI-AI defines how client applications communicate with inference services, enabling use cases like voice chat with LLMs, enterprise voice workflows, video avatars, voice-driven user interfaces, and high-framerate image generation. The demo leverages Llama 3.1 running on @GroqInc and has impressive 500ms voice-to-voice response times (which is comparable to real-life conversations!) and shows how far the frontier of tech for live voice agents has come in a short time. [github.com] Share this story by email

UBI Study Findings Released

First impacted: Policy makers, social scientists

A study by OpenResearch with backing from Sam Altman examined the effects of giving $1,000 per month to low-income individuals. The study explored the impact on spending, agency, employment, health, and moving. The summary findings stated that: "The program resulted in a 2.0 percentage point decrease in labor market participation for participants and a 1.3-1.4 hour per week reduction in labor hours, with participants’ partners reducing their hours worked by a comparable amount. The transfer generated the largest increases in time spent on leisure, as well as smaller increases in time spent in other activities such as transportation and finances. Despite asking detailed questions about amenities, we find no impact on quality of employment, and our confidence intervals can rule out even small improvements. We observe no significant effects on investments in human capital, though younger participants may pursue more formal education. Overall, our results suggest a moderate labor supply effect that does not appear offset by other productive activities." [openresearchlab.org] Share this story by email

…AI: OpenAI "On the Cusp" of Level 2 AGI, Generalist Robotics Model, New TTT Architecture, and Better Tiny Local Models (7.16.24)

Sasha Krecinic — Wed, 17 Jul 2024 01:01:00 +0000

This week's AI developments make up for a slow week last week. We see notable commentary, research, and applications. First, OpenAI is allegedly making strides towards Level 2 AI, which they have labeled as "Reasoners," and defined as performing human-level problem-solving tasks. Research on Test-Time Training (TTT) layers shows that models can adapt and improve in real-time, potentially outperforming traditional models in long-context tasks. Skild AI's recent funding underscores the buy-in and investment in generalist AI models to drive robotics forward, and finally, Meta's MobileLLM models demonstrate efficient and capable, on-device AI solutions that address limitations in mobile technology. Happy reading!

--Sasha Krecinic

OpenAI "On the Cusp" of Level 2 AGI

According to a recent Bloomberg article, OpenAI executives have said the company is “on the cusp” of Level 2 AGI in its five-tier AI progress tracking system. It is also rumored that OpenAI demonstrated GPT-4 with improved reasoning capabilities at a recent all-hands meeting. Level 2, involves AI systems performing problem-solving tasks at the level of a human with a doctorate-level education without using any tools. The scale also includes the following stages of artificial intelligence:

Level 1: Chatbots - AI with conversational language abilities

Level 2: Reasoners - AI with human-level problem-solving capabilities

Level 3: Agents - Systems that can take actions

Level 4: Innovators - AI that can aid in invention

Level 5: Organizations - AI that can perform the work of an entire organization

[OpenAI Scale Ranks Progress Toward ‘Human-Level’ Problem Solving The company believes its technology is approaching the second level of five on the path to artificial general intelligence] Share this story by email

OpenAI Researcher Comments That Company Focus is Still on Ambitious Research

OpenAI researcher Noam Brown, a specialist in AI reasoning, tweeted, "When I joined @OpenAI a year ago, I feared ChatGPT's success might shift focus from long-term research to incremental product tweaks. But it quickly became clear that wasn't the case. @OpenAI excels at placing big bets on ambitious research directions driven by strong conviction. They remain committed to ambitious research despite the success of ChatGPT." This comment emphasizes that OpenAI continues to prioritize long-term research over incremental product tweaks. It also suggests that they are not solely measuring themselves against existing benchmarks but are focused on paradigm-shifting developments through research, such as Noam Brown's work. This stance contrasts with the messaging of the former alignment team, who recently departed OpenAI. [via @polynoamial] Share this story by email

TTT Layers Match Performance of Transformers and Mamba RNNs

According to a recent research paper, Test-Time Training (TTT) layers match or exceed the performance of strong Transformers and Mamba RNNs in long-context tasks. RNNs stands for Recurrent Neural Networks. They are a type of artificial neural network designed to recognize patterns in sequences of data, such as text, genomes, handwriting, or numerical time series data. TTT is a method where a model continues to update its parameters during the testing phase using self-supervised learning, allowing it to adapt to new data in real-time and improve performance dynamically. The paper highlights that TTT-Linear is faster than Transformers at 8k context and matches Mamba RNNs in wall-clock time. Wall-clock time refers to the actual elapsed time it takes to complete a task, as opposed to the number of operations or computational steps, and is crucial for evaluating the real-world efficiency of algorithms. Despite facing memory I/O challenges, TTT-MLP shows significant potential. TTT layers uniquely update their hidden state through self-supervised learning even during test sequences, making them highly adaptive and efficient for long-context tasks. This innovative approach of using a machine learning model as the hidden state allows TTT-Linear to outperform Transformers in speed at 8k context, demonstrating its potential for scalable applications. [GitHub - test-time-training/ttt-lm-pytorch: Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States] Share this story by email

Skild AI Raises $300 Million in Series A Funding To Develop Universal Robots

Pittsburgh-based robotics startup Skild AI has raised $300 million at a $1.5 billion valuation in a Series A funding round led by Lightspeed Ventures, Softbank, Coatue, and Jeff Bezos. Skild AI's models enable robots to perform tasks in unfamiliar environments, such as climbing stairs and recovering objects that slip out of hand. The robots demonstrated emergent capabilities, showcasing abilities they weren't explicitly taught. The AI model was trained on a database 1,000 times larger than those used by competitors, using diverse data collection techniques. According to the company's press release: "Skild’s model serves as a shared, general-purpose brain for a diverse embodiment of robots, scenarios, and tasks, including manipulation, locomotion, and navigation. From resilient quadrupeds mastering adverse physical conditions to vision-based humanoids performing dexterous manipulation of objects for complex household and industrial tasks, the company’s model will enable the use of low-cost robots across a broad range of industries and applications." [This $1.5 Billion AI Company Is Building A ‘General Purpose Brain’ For Robots] Share this story by email

Meta's MobileLLM Models Boost On-Device Accuracy Without Increasing Size

Meta has introduced MobileLLM models designed for efficient on-device large language models (LLMs). With fewer than a billion parameters, these models perform competitively with larger models in specific tasks, showing significant improvements in chat benchmarks and API calling tasks. They use deep and thin architectures with embedding sharing and grouped-query attention mechanisms, enhancing accuracy without increasing model size. The practicality of these small models is highlighted for mobile devices, addressing concerns such as memory capacity and energy consumption, making them suitable for common on-device use cases. [MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases] Share this story by email

…AI: OpenAI Goes Shopping; Anthropic Edges Into First Place; and Microsoft's Tiny Vision Model (6.26.24)

Sasha Krecinic — Thu, 27 Jun 2024 00:54:50 +0000

…AI: OpenAI Goes Shopping; Anthropic Edges Into First Place; and Microsoft's Tiny Vision Model (6.26.24)

This week’s respective models have received substantial infrastructure and performance enhancements. OpenAI’s acquisitions of Multi and Rockset are strategic moves to enhance its remote control and data retrieval capabilities, highlighting a broader industry shift towards more versatile and powerful agentic workflows. This really makes you wonder - not if, but when - they will release a search product. On the state-of-the-art (SOTA) front, we observed Anthropic’s enhancements with Claude 3.5 Sonnet and Microsoft’s debut of the Florence vision model, both of which represent significant advancements in AI model efficiency and effectiveness. Lastly, Ilya Sutskever, co-founder and former Chief Scientist at OpenAI, has news on his new project. Considering Ilya’s departure from OpenAI last month, this week’s headlines about acquisitions focused on 'Search' and 'remote control' potentially shed light on why he left OpenAI. Is it possible this direction did not align with the former Chief Scientist’s emphasis on creating safe AI?

-- Sasha Krecinic

OpenAI Acquires Rockset To Boost "Data Infrastructure" and "Retrieval" (aka Search)

OpenAI has acquired Rockset to enhance its data retrieval systems and plans to integrate Rockset's technology into its products to turn data into "useful insights". If you read some of Rockset's documentation you will get a sense of what the team is working to solve. It would not be crazy to imagine a world where they power search with this. It could be the new generation of ad targeting and personalized recommendations. The capability to deliver nuanced, context-aware insights and recommendations opens new revenue streams, enhancing OpenAI's competitive and commercial edge online and within agentic workflows. [OpenAI Acquires Rockset] Share this story by email

OpenAI Acquires Multi, A Company Powering Remote Computer Control

OpenAI has acquired Multi, a startup specializing in remote computer control, and announced that Multi will stop its services. Multi has closed new team signups, and existing users can access the app until July 24, 2024, after which all user data will be deleted. [via @itsandrewgao] Share this story by email

Anthropic's Claude 3.5 Sonnet Climbs Leaderboard

Anthropic says its Claude 3.5 Sonnet outperforms competitor models and Claude 3 Opus, operating at twice the speed and one-fifth the cost. Claude 3.5 Sonnet is their strongest vision model, surpassing Claude 3 Opus on standard vision benchmarks but still slightly behind OpenAI's offering. [Introducing Claude 3.5 Sonnet] Share this story by email

Microsoft Launches Florence-2 Vision Model

Microsoft has launched Florence-2, a vision model with 200M and 800M parameters, which they say matches the quality of models 100 times larger. Florence-2 is designed to handle a wide range of computer vision tasks, including captioning, object detection, and segmentation, using a unified, prompt-based representation. The model demonstrates strong zero-shot and fine-tuning performance, addressing challenges in spatial hierarchy and semantic granularity, and achieving state-of-the-art results on various vision benchmarks. [Florence - a microsoft Collection] Share this story by email

Ilya Sutskever Launches Safe Superintelligence Inc.

Safe Superintelligence Inc. says it is focused on creating the first safe superintelligence by addressing safety and capabilities together. Sutskever announced that they will focus on "one goal and one product" and also mentioned that they are hiring. [Safe Superintelligence Inc.] Share this story by email

…AI: What If Our Assumptions On Compute Requirements Are Wrong? How A Tiny 8B Model Outperforms GPT-4 (6.21.24)

Sasha Krecinic — Sat, 22 Jun 2024 02:55:00 +0000

This week's edition focuses on the AI/Nature parallels, and one of the hottest fields of research: neuromorphic computing (the method of computer engineering in which elements of a computer are modeled after systems in the human brain and nervous system).

While nature isn't a perfect model, it provides some upper and lower bounds for us to benchmark what is theoretically possible. The human brain uses something like 25 watts of energy for cognitive functions per hour. A single GPT-4 query is estimated to use something like 300 watts. This chasm of consumption hints that nature has found a way to solve for 'compute' on a budget. Algorithmic optimization and neuromorphic computing, using nature as inspiration and a research catalyst, have the potential to solve our ever-increasing hunger for computing and dynamically trained models.

This week, we have some interesting developments on this front, raising an important question: "What if our assumptions about the computing requirements to train and run these models are wrong?" Researchers from the Shanghai AI Lab published a paper that sheds some light, achieving results from a tiny 8B parameter model that outperforms GPT-4 in math and coding. We also see a RAG methodology called HippoRAG that emulates aspects of the hippocampus in the human brain. Additionally, we see an LLM training methodology from the team at SakanaAI, who use large language models for discovering and optimizing new training algorithms, akin to natural evolution.

A final thought: Is this AI wave just hype? Have we “used up all the data for training”? Will we run out of compute? There is a lot of commentary like this saying that AI is “losing steam.” However, there are two different things that people often conflate. On the surface, this notion might appear to be true if you measure it by today's commercial applications (which still leverage legacy technology often). From a "frontier" research perspective (which I would posit is the rate-limiting step here), the developments we cover this month, in tandem with commentary from the founders I have spoken to, paint a very different picture.

To paraphrase some sentiments from experienced founders, "The frontier is moving so quickly that you have to choose what to build very carefully so you are not made obsolete by the rapid movements at the frontier." Sadly, the investment and hype at the wrapper/surface layer are overshadowing the large strides that continue to be made in the research space. Tying this back to this week’s title: the research is how an 8B parameter model outperformed GPT-4.

Llama-3 Model with MCTS Outperforms GPT-4 in Math Tasks

A small 8B Llama-3 model combined with Monte Carlo Tree Search (MCTS) reportedly outperforms GPT-4 in complex mathematical reasoning tasks. The MCTS algorithm enhances mathematical reasoning by combining LLMs with Monte Carlo Tree Search (MCTS). It systematically explores and refines solutions using heuristic methods (practical techniques used to solve complex problems efficiently, relying on rules of thumb, educated guesses, and intuitive judgments to find satisfactory solutions quickly, even if they are not perfect or optimal). The algorithm builds a search tree by selecting, refining, evaluating options, and optimizing decisions using an enhanced Upper Confidence Bound (UCB) formula. Testing shows that MCTS significantly improves success rates on challenging math problems, advancing LLMs in complex tasks for more accurate and reliable AI-driven decision-making. [twitter.com] Share this story by email

Sakana AI Shares DiscoPOP - Evolution for LLMs by LLMs

By leveraging LLMs to automatically create and test new optimization algorithms, Sakana AI developed Discovered Preference Optimization (DiscoPOP), a novel method combining logistic and exponential losses. This approach, which showed state-of-the-art performance, marks a step towards using AI to advance AI, reducing human intervention and computational resources. The research highlights the potential of LLM-driven discovery to continuously improve AI models, opening new avenues for innovation and efficiency in AI development. [Sakana AI] Share this story by email

HippoRAG: RAG That Mimics Human Memory, Enhancing Speed and Efficiency

New research from Ohio State University introduces HippoRAG, a retrieval framework inspired by human memory, which they say outperforms existing methods by up to 20%, while being 10-30 times cheaper and 6-13 times faster. The study highlights HippoRAG's integration of LLMs, knowledge graphs, and the Personalized PageRank algorithm, demonstrating improvements in multi-hop question answering and single-step retrieval. HippoRAG mimics the human brain's memory processes by integrating large language models (LLMs) with knowledge graphs and the Personalized PageRank algorithm, similar to how the hippocampus and neocortex work together to store and integrate knowledge efficiently and effectively. [HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models ] Share this story by email

Greenblatt Achieves 50% Accuracy on ARC-AGI Test with GPT-4o

Ryan Greenblatt says he achieved 50% accuracy on the ARC-AGI public test set using GPT-4o, surpassing the previous state-of-the-art of 34%. He claims his solution reached 72% accuracy on a subset of the train set, compared to human performance of 85%, by using specialized few-shot prompts and better grid representations. [Getting 50% (SoTA) on ARC-AGI with GPT-4o] Share this story by email

DeepSeek-Coder-V2 surpasses GPT4-Turbo in Coding and Math

DeepSeek-Coder-V2, an open-source model, has reportedly outperformed GPT4-Turbo in coding and math, supporting 338 programming languages and extending context length to 128K. According to a paper posted on GitHub, it achieved 90.2% on HumanEval and 75.7% on MATH, surpassing GPT-4-Turbo-0409. [DeepSeek-Coder-V2/paper.pdf at main · deepseek-ai/DeepSeek-Coder-V2] Share this story by email

...AI: Apple Integrates ChatGPT, Unbabel's SOTA Translation LLM, and a Mixture of Agents Paper (6.11.24)

Sasha Krecinic — Wed, 12 Jun 2024 00:31:08 +0000

This week we had WWDC, and Apple and OpenAI announced the integration of ChatGPT with Siri on Apple devices. This move didn't surprise many, as OpenAI and Apple's talks have been an open secret for months. We see a great summary by Andrej Karpathy outlining what many are thinking and seeing on this. We also see new state-of-the-art (SOTA) models in translation, an open-source robotics offering, and a research paper on MoA (Mixture of Agents) and how the framework is pushing the frontier of AI capabilities yet again.

— Sasha Krecinic

ChatGPT Integration Announced for Apple Devices

First impacted: Apple device users, AI technology enthusiasts

Apple announced at WWDC 2024 that ChatGPT will be integrated into Siri and available for free in iOS 18 and macOS Sequoia later this year. This partnership with OpenAI aims to enhance Apple's AI features, making advanced AI accessible while maintaining a commitment to safety and innovation. Sam Altman also says he is excited about partnering with Apple to integrate ChatGPT into their devices later this year, which he believes users will greatly appreciate. The highlight of the show and the conference was the new Siri demo. [via @sama] Share this story by email

Andrej's Breakdown of ‘Apple Intelligence’

First impacted: Apple device users, developers

Andrej Karpathy praised Apple's Intelligence announcement, highlighting the integration of AI across the entire OS. He outlined key themes: enabling multimodal I/O, seamless inter-operation of OS and apps, and a frictionless user experience. He also emphasized the potential for proactive AI features, on-device intelligence, and modular support for various function calling while maintaining privacy with on-device computing. Zooming out, we agree that this is possibly the first step in a fully autonomous and highly personalized AI strategy and capable agent model, where computer vision takes on-screen data as visual context and potentially moves in the direction of Microsoft's Copilot/Recall feature by 'seeing' or recording on-screen activity. [via @karpathy] Share this story by email

Unbabel Launches TowerLLM and Reaches SOTA Translation Performance

First impacted: multilingual content managers, software developers

Unbabel says it has launched TowerLLM, a new translation LLM that outperforms competitors like GPT-4, GPT-3.5, Google, and DeepL in accuracy and cost-efficiency. The company highlights that TowerLLM, built on billions of words of high-quality translation data, offers features such as source correction, and named entity recognition, and supports 18 language pairs across various domains. [Introducing TowerLLM, Multilingual by design Unbabel’s Generative AI model is the best performing machine translation on the market, enabling our customers to scale globally with lower costs and higher accuracy.] Share this story by email

Hugging Face Expands Locally Hosted AI App Offerings

First impacted: AI developers, tech-savvy consumers

Hugging Face has launched a second batch of local Generative AI apps, now available on compatible model pages. The company welcomes new additions to its community, a sentiment echoed by retweets from its CEO, Clement Delangue. [via @julien_c] Share this story by email

LeRobot Launches on PyTorch to Democratize Robotics

First impacted: robotics engineers, AI researchers

LeRobot, developed on PyTorch, has been launched on the Hugging Face community page to enhance accessibility in robotics using advanced AI tools and models. According to their press release, LeRobot provides pre-trained models, datasets, and simulation environments to facilitate learning complex tasks in robotics without the need for physical robot assembly. [via @RemiCadene] Share this story by email

Mixture-of-Agents Methodology Outperforms GPT-4 Omni

First impacted: AI researchers, software developers

In a recent research paper, scientists introduced the Mixture-of-Agents methodology, which combines multiple LLMs to enhance language model performance, achieving a 65.1% score on AlpacaEval 2.0 and surpassing GPT-4 Omni's 57.5%. This method utilizes a layered architecture where each layer's LLM agents refine responses based on the previous layer's outputs, demonstrating improved performance using only open-source LLMs. [Mixture-of-Agents Enhances Large Language Model Capabilities] Share this story by email

. . AI: Why Humans Struggle to Estimate AI Progress, Antitrust Woes, OpenAI Hiring Robotics, and, Datasets (6.6.24)

Sasha Krecinic — Fri, 07 Jun 2024 00:42:14 +0000

This week's edition focuses on the training and progression of AI towards AGI. We also see activity by regulators for antitrust investigations, some new roles in Robotics advertised by the OpenAI team, and a new dataset for model training with detailed commentary. Short, sweet, and enlightening. Enjoy!

-- Sasha Krecinic

Why do we struggle to estimate AI Advancements?

First impacted: Everyone

Humans inherently struggle with understanding exponential relationships and growth. The wheat and chessboard problem illustrates this. The story goes that a king agrees to place one grain of rice/wheat on the first square of a chessboard, doubling it on each subsequent square. By the 64th square, the amount of wheat is huge, enough to bankrupt the kingdom. Similarly, during COVID-19, many couldn't grasp how quickly the virus could spread, leading to delayed responses and widespread impacts. We all had that friend who warned, “It’s coming, start preparing,” but most didn’t listen.

It may not be obvious, but AI development is experiencing similar exponential growth and the information asymmetry is also increasing. To understand how it will evolve, you need to look at the sub-components, roadmaps, and rate-limiting steps in the hardware, software, data, people, and capital. There are those in the know who quietly acknowledge this exponential growth, aware of its potential, often keeping their ‘extreme views’ to themselves, once again, analogous to early COVID-19.

Leopold Aschenbrenner's extensive analysis, "Situational Awareness," highlights the accelerating pace of AI capabilities across the various layers that drive innovation. From massive compute clusters to evolving AI models, the trajectory toward Artificial General Intelligence (AGI) is still a function of the sum of its parts. Leopold posits that AI advancements will continue to be exponential, driven by computing power, algorithmic efficiency, and new methodologies. Despite potential bottlenecks like data scarcity and unknowing challenges, the path to AGI is becoming clearer. Improvements in hardware, software, data, skills, and capital are compounding and driving transformative impacts that are often hiding in plain sight.

While mainstream views often downplay AI ‘end game’ scenarios, a small group of experts is busily preparing for and predicting an ‘imminent’ (read this as within the next 3-5 years) AGI breakthrough. As an investor in this space, you see a range of people's reactions to this; unfortunately, some are still a little too skeptical, in my opinion. Naivety on this scale hasn’t served societies well historically. If we account for the consistent revision of AGI forecasts, it also points to a trend that AGI may be much closer than society thinks. Check out the full text if you'd like to see the detailed breakdown, I think it's worth a read! [Situational awareness] Share this story by email

Regulators Target AI Giants for Antitrust Probes

First impacted: AI industry analysts, regulatory compliance officers

Federal regulators have agreed to initiate antitrust investigations into Microsoft, OpenAI, and Nvidia to examine their dominant positions in the AI industry, according to the New York Times, referencing two individuals familiar with the confidential discussions. The Justice Department will investigate Nvidia, while the FTC will focus on OpenAI and Microsoft, reflecting a broader initiative to address potential monopolistic practices in the AI sector. [U.S. Clears Way for Antitrust Inquiries of Nvidia, Microsoft and OpenAI] Share this story by email

OpenAI Recruits for Robotics

First impacted: Robotics engineers, AI researchers

OpenAI is seeking a Research Engineer for its Robotics team in San Francisco to focus on training and fine-tuning large multimodal LLMs, as detailed in their job posting. The role involves collaborating with industry partners to enhance robotics applications and signals the first time they have made these sorts of moves since 2020. [OpenAI recruits for robotics talent] Share this story by email

FineWeb-Edu Sets New Dataset Standard

First impacted: AI researchers, AI developers

Guilherme Penedo from Hugging Face shared a report on the release of FineWeb and its educational subset, FineWeb-Edu, which incorporates 1.3 trillion tokens from a high-quality filtered Common Crawl dataset. According to the report, FineWeb-Edu outperforms all other publicly available web-scale datasets on benchmarks such as MMLU, ARC, and OpenBookQA, prompting a reassessment of the perceived quality of internet data. The post also received a lot of praise from industry leaders like Andrej Karpathy and Thomas Wolf, who noted it as potentially the best 45 minutes of reading you could do in the space if you want to understand how high-performing models work! Check out the full text if you're in this space! [via @Thom_Wolf] Share this story by email

—

. . AI: Smaller Models Get Better and China Invests in Chip Independence (5.29.24)

Sasha Krecinic — Thu, 30 May 2024 06:14:32 +0000

This week's newsletter has a theme: the miniaturization of models while maintaining comparable results to larger models. This once again shows a potential path to locally hosted models. We have also intentionally left out all the drama/arguing happening on X between Elon and Yann, as it lacked substance. We also spotlight China's huge $47 billion USD investment in its semiconductor industry in an attempt to gain independence from US chip manufacturers and regulations.

— Sasha