AI inference company Groq has closed a $650 million funding round as it pivots its business following a landmark IP licensing agreement with Nvidia. The round was led by Disruptive, a Dallas-based late-stage investment firm whose founder Alex Davis also serves as Groq’s chairman, alongside Fort Lauderdale hedge fund Infinitum. The raise comes roughly six […]
Sam Altman has unveiled OpenAI’s first custom-built AI chip, Jalapeño, as the company moves to reduce its reliance on third-party hardware and strengthen control over the infrastructure powering its artificial intelligence products. According to OpenAI, the company has developed its…
Broadcom's custom chip for OpenAI could diversify the AI hardware market, challenging Nvidia's dominance and potentially lowering costs.
The post Broadcom unveils custom chip for OpenAI, challenging Nvidia’s dominance appeared first on Crypto Briefing.
Nvidia's current valuation could offer strategic investment opportunities, especially if AI-driven semiconductor growth meets future projections.
The post Nvidia trades cheaper than semiconductor sector, says Tony Zhang appeared first on Crypto Briefing.
Insider Brief PRESS RELEASE — Hydra Host has announced the closing of its Series A financing round of $100 million, led by Kindred Ventures, with participation from NVIDIA, ARK Invest, SPLY Capital, Jasper Lau’s Era Funds, Comcast Ventures, Magnetar, and PEAK6. Existing investors in the round include Founders Fund, 10x Founders, Sterling Road, and Flume Ventures (with […]
OpenAI has just revealed a new "intelligence processor" chip for AI servers made in partnership with Broadcom. The chip, called Jalapeño, is designed to power current and future large language models, according to an announcement on Wednesday.
Jalapeño is an ASIC (Application-Specific Integrated Circuit), meaning it's designed for a specific purpose: AI inference. With AI inference, models process a user's request to run an agent like Codex or offer a response from ChatGPT, while AI training involves a model consuming vast amounts of data to inform its responses.
It comes just nine months after OpenAI revealed that it would team up with Br …
Read the full story at The Verge.
Qualcomm's acquisition of Modular could diversify its revenue streams and challenge Nvidia's dominance in the data center AI market.
The post Qualcomm acquires Modular to enhance AI capabilities for data center push appeared first on Crypto Briefing.
UC San Diego's DFlash replaces autoregressive drafting with a lightweight block diffusion model for speculative decoding. It drafts whole token blocks in a single forward pass and conditions on target hidden features through KV injection. The paper reports up to 6.08x lossless speedup on Qwen3-8B, while NVIDIA reports up to 15x throughput on Blackwell at fixed interactivity. DFlash ships 20 checkpoints and supports SGLang, vLLM, and TensorRT-LLM.
The post DFlash Speculative Decoding Drafts Whole Token Blocks in Parallel for Up to 15x Higher Throughput on NVIDIA Blackwell appeared first on MarkTechPost.