how deep does the rabbit hole go? try searching for "matrix"...
Show HN: CocoIndex – Open-Source Data framework for AI, built for data freshness
Hi HN, I’ve been working on CocoIndex, an open-source Data ETL framework to transform data for AI, optimized for data freshness.<p>You can start a CocoIndex project with `pip install cocoindex` and declare a data flow that can build ETL like LEGO - build a RAG pipeline for vector embeddings, knowledge graphs, or extract, transform data with LLMs. It is a data processing framework beyond text. When you run the data flow either with live mode or batch mode, it will process the data incrementally with minimal recomputation and make it super fast to update the target stores on source changes.<p>Get started video: <a href="https://www.youtube.com/watch?v=gv5R8nOXsWU" rel="nofollow">https://www.youtube.com/watch?v=gv5R8nOXsWU</a> Demo video: <a href="https://www.youtube.com/watch?v=ZnmyoHslBSc" rel="nofollow">https://www.youtube.com/watch?v=ZnmyoHslBSc</a><p>Previously, I’ve worked at Google on projects like search indexing and ETL infra for 8 years. After I left Google last year, I built various projects and went through pivoting hell. In all the projects I’ve built, data still sits in the center of the problem and I find myself focusing on building data infra other than the business logic I need for data transformation. The current prepackaged RAG-as-service doesn't serve my needs, because I need to choose a different strategy for the context, and I also need deduplication, clustering (items are related), and other custom features that are commonly needed. That’s where CocoIndex starts.<p>A simple philosophy behind it - data transformation is similar to formulas in spreadsheets. The ground of truth is at the source data, and all the steps to transform, and final target store are derived data, and should be reactive based on the source change. If you use CocoIndex, you only need to worry about defining transformations like formulas.<p>*Data flow paradigm* came in as an immediate choice - because there’s no side effect, lineage and observability just come out of the box.<p>*Incremental processing* - If you are a data expert, an analogy would be a materialized view beyond SQL. The framework tracks pipeline states in database, and only reprocessing necessary portions. When data has changed, framework handles the change data capture comprehensively and combines the mechanism for push and pull. Then clear stale derived data/versions and re-index data based on tracking data/logic changes or data TTL settings. There’s lots of edge cases to do it right, for example, when a row is referenced in other places, and the row changes. These should be handled at the level of the framework.<p>*At the compute engine level* - the framework should consider the multiple processes and concurrent updates. It should consider how to resume existing states from terminated execution. In the end, we want to build a framework that is easy to build with exceptional velocity, but scalable and robust in production.<p>*Standardized the interface throughout the data flow* - really easy to plugin custom logic like LEGO; with a variety of native built-in components. One example is that it takes a few lines to switch among Qdrant, Postgres, Neo4j.<p>CocoIndex is licensed under Apache 2.0 <a href="https://github.com/cocoindex-io/cocoindex">https://github.com/cocoindex-io/cocoindex</a><p>Getting started: <a href="https://cocoindex.io/docs/getting_started/quickstart" rel="nofollow">https://cocoindex.io/docs/getting_started/quickstart</a><p>Excited to learn your thoughts, and thank you so much! Linghua
Show HN: Morphik – Open-source RAG that understands PDF images, runs locally
Hey HN, we’re Adi and Arnav. A few months ago, we hit a wall trying to get LLMs to answer questions over research papers and instruction manuals. Everything worked fine, until the answer lived inside an image or diagram embedded in the PDF. Even GPT‑4o flubbed it (we recently tried O3 with the same, and surprisingly it flubbed it too). Naive RAG pipelines just pulled in some text chunks and ignored the rest.<p>We took an invention disclosure PDF (<a href="https://drive.google.com/file/d/1ySzQgbNZkC5dPLtE3pnnVL2rW_9aTeuG/view?usp=sharing" rel="nofollow">https://drive.google.com/file/d/1ySzQgbNZkC5dPLtE3pnnVL2rW_9...</a>) containing an IRR‑vs‑frequency graph and asked GPT “From the graph, at what frequency is the IRR maximized?”. We originally tried this on gpt-4o, but while writing this used the new natively multimodal model o4‑mini‑high. After a 30‑second thinking pause, it asked for clarifications, then churned out buggy code, pulled data from the wrong page, and still couldn’t answer the question. We wrote up the full story with screenshots here: <a href="https://docs.morphik.ai/blogs/gpt-vs-morphik-multimodal">https://docs.morphik.ai/blogs/gpt-vs-morphik-multimodal</a>.<p>We got frustrated enough to try fixing it ourselves.<p>We built Morphik to do multimodal retrieval over documents like PDFs, where images and diagrams matter as much as the text.<p>To do this, we use Colpali-style embeddings, which treat each document page as an image and generate multi-vector representations. These embeddings capture layout, typography, and visual context, allowing retrieval to get a whole table or schematic, not just nearby tokens. Along with vector search, this could now retrieve exact pages with relevant diagrams and pass them as images to the LLM to get relevant answers. It’s able to answer the question with an 8B llama 3.1 vision running locally!<p>Early pharma testers hit our system with queries like "Which EGFR inhibitors at 50 mg showed ≥ 30% tumor reduction?" We correctly returned the right tables and plots, but still hit a bottleneck, we weren’t able to join the dots across multiple reports. So we built a knowledge graph: we tag entities in both text and images, normalize synonyms (Erlotinib → EGFR inhibitor), infer relations (e.g. administered_at, yields_reduction), and stitch everything into a graph. Now a single query could traverse that graph across documents and surface a coherent, cross‑document answer along with the correct pages as images.<p>To illustrate that, and just for fun, we built a graph of 100 Paul Graham’s essays here: <a href="https://pggraph.streamlit.app/" rel="nofollow">https://pggraph.streamlit.app/</a> You can search for various nodes, (eg. startup, sam altman, paul graham and see corresponding connections). In our system, we create graphs and store the relevant text chunks along with the entities, so on querying, we can extract the relevant entity, do a search on the graph and pull in the text chunks of all connected nodes, improving cross document queries.<p>For longer or multi-turn queries, we added persistent KV caching, which stores intermediate key-value states from transformer attention layers. Instead of recomputing attention from scratch every time, we reuse prior layers, speeding up repeated queries and letting us handle much longer context windows.<p>We’re open‑source under the MIT Expat license: <a href="https://github.com/morphik-org/morphik-core">https://github.com/morphik-org/morphik-core</a><p>Would love to hear your RAG horror stories, what worked, what didn’t and any feedback on Morphik. We’re here for it.
Show HN: I open-sourced my AI toy company that runs on ESP32 and OpenAI realtime
Hi HN! Last year the project I launched here got a lot of good feedback on creating speech to speech AI on the ESP32. Recently I revamped the whole stack, iterated on that feedback and made our project fully open-source—all of the client, hardware, firmware code.<p>This Github repo turns an ESP32-S3 into a realtime AI speech companion using the OpenAI Realtime API, Arduino WebSockets, Deno Edge Functions, and a full-stack web interface. You can talk to your own custom AI character, and it responds instantly.<p>I couldn't find a resource that helped set up a reliable, secure websocket (WSS) AI speech to speech service. While there are several useful Text-To-Speech (TTS) and Speech-To-Text (STT) repos out there, I believe none gets Speech-To-Speech right. OpenAI launched an embedded-repo late last year which sets up WebRTC with ESP-IDF. However, it's not beginner friendly and doesn't have a server side component for business logic.<p>This repo is an attempt at solving the above pains and creating a great speech to speech experience on Arduino with Secure Websockets using Edge Servers (with Deno/Supabase Edge Functions) for fast global connectivity and low latency.
Show HN: Open Codex – OpenAI Codex CLI with open-source LLMs
Hey HN,<p>I’ve built Open Codex, a fully local, open-source alternative to OpenAI’s Codex CLI.<p>My initial plan was to fork their project and extend it. I even started doing that. But it turned out their code has several leaky abstractions, which made it hard to override core behavior cleanly. Shortly after, OpenAI introduced breaking changes. Maintaining my customizations on top became increasingly difficult.<p>So I rewrote the whole thing from scratch using Python. My version is designed to support local LLMs.<p>Right now, it only works with phi-4-mini (GGUF) via lmstudio-community/Phi-4-mini-instruct-GGUF, but I plan to support more models. Everything is structured to be extendable.<p>At the moment I only support single-shot mode, but I intend to add interactive (chat mode), function calling, and more.<p>You can install it using Homebrew:<p><pre><code> brew tap codingmoh/open-codex brew install open-codex </code></pre> It's also published on PyPI:<p><pre><code> pip install open-codex </code></pre> Source: <a href="https://github.com/codingmoh/open-codex">https://github.com/codingmoh/open-codex</a>
Local LLM inference – impressive but too hard to work with
Show HN: Light like the Terminal – Meet GTK LLM Chat Front End
Author here. I wanted to keep my conversation with #Gemini about code handy while discussing something creative with #ChatGPT and using #DeepSeek in another window. I think it's a waste to have Electron apps and so wanted to chat with LLMs on my own terms. When I discovered the llm CLI tool I really wanted to have convenient and pretty looking access to my conversations, and so I wrote gtk-llm-chat - a plugin for llm that provides an applet and a simple window to interact with LLM models.<p>Make sure you've configure llm first (<a href="https://llm.datasette.io/en/stable/" rel="nofollow">https://llm.datasette.io/en/stable/</a>)<p>I'd love to get feedback, PRs and who knows, perhaps a coffee! <a href="https://buymeacoffee.com/icarito" rel="nofollow">https://buymeacoffee.com/icarito</a>
AI assisted search-based research works now
Launch HN: Magic Patterns (YC W23) – AI Design and Prototyping for Product Teams
Alex and Teddy here. We’re launching Magic Patterns (<a href="https://www.magicpatterns.com">https://www.magicpatterns.com</a>), an AI prototyping tool that helps PMs and designers create functional, interactive designs and websites. There’s a demo video at <a href="https://www.youtube.com/watch?v=SK8C_tQBwIU" rel="nofollow">https://www.youtube.com/watch?v=SK8C_tQBwIU</a>, as well as video walkthroughs of specific examples at <a href="https://www.magicpatterns.com/docs/documentation/tutorials/video-tutorials">https://www.magicpatterns.com/docs/documentation/tutorials/v...</a><p>While other tools help with “AI-assisted coding,” we have been quietly focused on “AI-assisted designing.” With Magic Patterns you can visually communicate your idea, get hands on feedback from customers, and test new features.<p>Teddy and I are best friends and former frontend engineers turned founders. We arrived at Magic Patterns after several pivots—always in the design tooling space, but different products that all struggled to get usage. We started working on Magic Patterns after an internal hackathon. Teddy built a UI library catalog and I messed around with GPT 3.5. We thought it’d be fun to combine the two: an AI component generator. Describe whatever you want, and get back a React component!<p>That started to take off and we gained users, but it wasn’t developers using the tool. Instead, it was PMs, designers, and leadership who could finally communicate their ideas. They use it to test new ideas quickly, get feedback from customers, and improve communication with internal teams. Also, hobbyists (and programmers who aren’t designers) use us to create designs and UIs that they wouldn’t be able to otherwise.<p>We use Sonnet 3.5 and 3.7, and leverage a fine-tuned model for fast-applying edits. The most challenging part is determining the most relevant context to feed to the LLM. We attempt to solve this with our click to update feature and by letting users define a brand preset, or default prompt.<p>Unlike other tools in this space, we’re specifically focused on (1) product teams—we're realtime and collaborative; and (2) frontend only—we don't spin up a database or backend because we aren't solving "idea to fullstack app."<p>A common workflow is a product manager building an interactive prototype and then passing it off to a designer for more polish or directly to engineers. Many teams are even skipping Figma entirely now, telling us that it feels like an unnecessary middleman. Teams are instead generating clickable prototypes, collaborating directly with stakeholders, and using that as the mockup.<p>With Magic Patterns, you can: - Collaborate with your team on our infinite canvas; - Match your existing designs by creating reusable components directly; - Brainstorm features and flows. (The latter is what we use it for internally.)<p>We started as a way to build small, custom components, but now people are one-shotting entire websites and hosting them with us, or building dashboards that they share internally or in customer demos. People have sold $10k/mo contracts with Magic Patterns designs!<p>Small business owners—everyone from fishermen to driving instructors to hotel managers—are using us to build their websites and then hosting them with us. Example sites built by Magic Patterns include <a href="https://getdealflow.ai/" rel="nofollow">https://getdealflow.ai/</a> and <a href="https://joinringo.com/" rel="nofollow">https://joinringo.com/</a>. It’s amazing how people who couldn’t have done that before are now able to, and super gratifying to us to be empowering people in this way.<p>You can get started with our docs here: <a href="https://www.magicpatterns.com/docs/documentation/get-started/introduction">https://www.magicpatterns.com/docs/documentation/get-started...</a>, and you can try the actual product. Simply go to <a href="https://www.magicpatterns.com">https://www.magicpatterns.com</a> and prompt for any UI you want.<p>Today no login is required, just click “Coming from Hackernews?” and you’ll get 5 messages free to try. Once you hit the limit, you’ll then be prompted to login. Plans start at $19/mo for another 100 messages a month (<a href="https://www.magicpatterns.com/pricing">https://www.magicpatterns.com/pricing</a>).<p>We’re stoked to be sharing with HN today and are open to all feedback!
Pushing the Limits of LLM Quantization via the Linearity Theorem
Jagged AGI: o3, Gemini 2.5, and everything after
Gemma 3 QAT Models: Bringing AI to Consumer GPUs
Show HN: I built an AI that turns GitHub codebases into easy tutorials
Show HN: Web Video editor, 100% local, AI subtitle, auto cut based on volume
I made this mostly to make content creation faster.<p>When recording a youtube video/ podcast. there's alot of downtime (silence) while looking at scripts or getting the correct item to display. So I made a webapp to automatically cut and stich the video back together then I can further edit in davinci resolve.<p>But I recently found out that this is crazy good for rewatching lectures or meeting recordings, No need to render, just drop the video in and watch it right away.<p>30min lecture can be watched in 12min!<p>I even added auto generation of subtitles to make studying lecture /meeting better.<p>Best of all everything runs locally, even the AI is running local models.<p>Hope HN finds this useful like it did for me!
OpenAI's new reasoning AI models hallucinate more
A new ChatGPT version just dropped and GeoGuesser is now a solved problem
Recursive LLM prompts
Show HN: AgentAPI – HTTP API for Claude Code, Goose, Aider, and Codex
Microsoft researchers developed a hyper-efficient AI model that can run on CPUs
Show HN: Plandex v2 – open source AI coding agent for large projects and tasks
Hey HN! I’m Dane, the creator of Plandex (<a href="https://github.com/plandex-ai/plandex">https://github.com/plandex-ai/plandex</a>), an open source AI coding agent focused especially on tackling large tasks in real world software projects.<p>You can watch a 2 minute demo of Plandex in action here: <a href="https://www.youtube.com/watch?v=SFSu2vNmlLk" rel="nofollow">https://www.youtube.com/watch?v=SFSu2vNmlLk</a><p>And here’s more of a tutorial style demo showing how Plandex can automatically debug a browser application: <a href="https://www.youtube.com/watch?v=g-_76U_nK0Y" rel="nofollow">https://www.youtube.com/watch?v=g-_76U_nK0Y</a>.<p>I launched Plandex v1 here on HN a little less than a year ago (<a href="https://news.ycombinator.com/item?id=39918500">https://news.ycombinator.com/item?id=39918500</a>).<p>Now I’m launching a major update, Plandex v2, which is the result of 8 months of heads down work, and is in effect a whole new project/product.<p>In short, Plandex is now a top-tier coding agent with fully autonomous capabilities. It combines models from Anthropic, OpenAI, and Google to achieve better results, more reliable agent behavior, better cost efficiency, and better performance than is possible by using only a single provider’s models.<p>I believe it is now one of the best tools available for working on large tasks in real world codebases with AI. It has an effective context window of 2M tokens, and can index projects of 20M tokens and beyond using tree-sitter project maps (30+ languages are supported). It can effectively find relevant context in massive million-line projects like SQLite, Redis, and Git.<p>A bit more on some of Plandex’s key features:<p>- Plandex has a built-in diff review sandbox that helps you get the benefits of AI without leaving behind a mess in your project. By default, all changes accumulate in the sandbox until you approve them. The sandbox is version-controlled. You can rewind it to any previous point, and you can also create branches to try out alternative approaches.<p>- It offers a ‘full auto mode’ that can complete large tasks autonomously end-to-end, including high level planning, context loading, detailed planning, implementation, command execution (for dependencies, builds, tests, etc.), and debugging.<p>- The autonomy level is highly configurable. You can move up and down the ladder of autonomy depending on the task, your comfort level, and how you weigh cost optimization vs. effort and results.<p>- Models and model settings are also very configurable. There are built-in models and model packs for different use cases. You can also add custom models and model packs, and customize model settings like temperature or top-p. All model changes are version controlled, so you can use branches to try out the same task with different models. The newly released OpenAI models and the paid Gemini 2.5 Pro model will be integrated in the default model pack soon.<p>- It can be easily self-hosted, including a ‘local mode’ for a very fast local single-user setup with Docker.<p>- Cloud hosting is also available for added convenience with a couple of subscription tiers: an ‘Integrated Models’ mode that requires no other accounts or API keys and allows you to manage billing/budgeting/spending alerts and track usage centrally, and a ‘BYO API Key’ mode that allows you to use your own OpenAI/OpenRouter accounts.<p>I’d love to get more HNers in the Plandex Discord (<a href="https://discord.gg/plandex-ai" rel="nofollow">https://discord.gg/plandex-ai</a>). Please join and say hi!<p>And of course I’d love to hear your feedback, whether positive or negative. Thanks so much!
Translate Fortran to C++ with AI and RAG
OpenAI Codex CLI: Lightweight coding agent that runs in your terminal
Principles for Building One-Shot AI Agents
12-factor Agents: Patterns of reliable LLM applications
I've been building AI agents for a while. After trying every framework out there and talking to many founders building with AI, I've noticed something interesting: most "AI Agents" that make it to production aren't actually that agentic. The best ones are mostly just well-engineered software with LLMs sprinkled in at key points.<p>So I set out to document what I've learned about building production-grade AI systems: <a href="https://github.com/humanlayer/12-factor-agents">https://github.com/humanlayer/12-factor-agents</a>. It's a set of principles for building LLM-powered software that's reliable enough to put in the hands of production customers.<p>In the spirit of Heroku's 12 Factor Apps (<a href="https://12factor.net/" rel="nofollow">https://12factor.net/</a>), these principles focus on the engineering practices that make LLM applications more reliable, scalable, and maintainable. Even as models get exponentially more powerful, these core techniques will remain valuable.<p>I've seen many SaaS builders try to pivot towards AI by building greenfield new projects on agent frameworks, only to find that they couldn't get things past the 70-80% reliability bar with out-of-the-box tools. The ones that did succeed tended to take small, modular concepts from agent building, and incorporate them into their existing product, rather than starting from scratch.<p>The full guide goes into detail on each principle with examples and patterns to follow. I've seen these practices work well in production systems handling real user traffic.<p>I'm sharing this as a starting point—the field is moving quickly so these principles will evolve. I welcome your feedback and contributions to help figure out what "production grade" means for AI systems!
Why Cloudflare Is the Perfect Infrastructure for Building AI Applications
Benn Jordan's AI poison pill and the weird world of adversarial noise
MooseAgent: A LLM Based Multi-Agent Framework for Automating Moose Simulation
The path to open-sourcing the DeepSeek inference engine
A hackable AI assistant using a single SQLite table and a handful of cron jobs
DolphinGemma: How Google AI is helping decode dolphin communication
Implementing DeepSeek R1's GRPO algorithm from scratch
AMD NPU and Xilinx Versal AI Engines Signal Processing in Radio Astronomy (2024) [pdf]
AI used for skin cancer checks at London hospital
Google Is Winning on Every AI Front
GeoDeep's AI Detection on Maxar's Satellite Imagery
Show HN: Chonky – a neural approach for text semantic chunking
TLDR: I’ve made a transformer model and a wrapper library that segments text into meaningful semantic chunks.<p>The current text splitting approaches rely on heuristics (although one can use neural embedder to group semantically related sentences).<p>I propose a fully neural approach to semantic chunking.<p>I took the base distilbert model and trained it on a bookcorpus to split concatenated text paragraphs into original paragraphs. Basically it’s a token classification task. Model fine-tuning took day and a half on a 2x1080ti.<p>The library could be used as a text splitter module in a RAG system or for splitting transcripts for example.<p>The usage pattern that I see is the following: strip all the markup tags to produce pure text and feed this text into the model.<p>The problem is that although in theory this should improve overall RAG pipeline performance I didn’t manage to measure it properly. Other limitations: the model only supports English for now and the output text is downcased.<p>Please give it a try. I'll appreciate a feedback.<p>The Python library: <a href="https://github.com/mirth/chonky">https://github.com/mirth/chonky</a><p>The transformer model: <a href="https://huggingface.co/mirth/chonky_distilbert_base_uncased_1" rel="nofollow">https://huggingface.co/mirth/chonky_distilbert_base_uncased_...</a>
Hassabis Says Google DeepMind to Support Anthropic's MCP for Gemini and SDK
Show HN: AI quiz generator from any topic or book in seconds
Google will let companies run Gemini models in their own data centers
Show HN: Atari Missile Command Game Built Using AI Gemini 2.5 Pro
A modern HTML5 canvas remake of the classic Atari game from 1980. Defend your cities and missile bases from incoming enemy attacks using your missile launchers. Initially built using the Google Gemini 2.5 Pro AI LLM model.
Relyance AI’s new data tracking tool paves the way for greater AI accountability and explainability
The ambitious big data governance startup Relyance AI Inc. wants to tackle one of the biggest roadblocks preventing highly regulated enterprises from adopting artificial intelligence with the launch of a new platform called Data Journeys. The company said tod…
More accurate coding: Researchers adapt Sequential Monte Carlo for AI-generated code
Researchers from MIT, Yale, McGill University and others found that adapting the Sequential Monte Carlo algorithm can make AI-generated code better.
TD Bank to Open AI Center in New York City
TD Bank Group will open a new office in New York City for its artificial intelligence (AI) research and development center, Layer 6, later this year. With this new office, Layer 6, which currently operates from its head office in Toronto, will grow to more cl…
Companies Race Toward AI Agent Adoption
Companies are thinking hard about how to get the most out of new agentic AI designs.
Conversational AI startups hear sound of money as demand rises
Investor interest in voice technology is rising sharply. Venture capital funding surged to $202 million in 2024 across 12 deals, up from $7 million in 2023, per Venture Intelligence. This year, $19 million has gone into voice AI startups as of April 16.
Adobe Invests In Synthesia, A Startup Focused On AI Video Clone Technology
While Adobe has been infusing artificial intelligence, or AI, into its consumer-targeted apps, like Photoshop, for the past couple of years, the company may be planning to produce something larger with AI for its enterprise offerings by partnering with compan…
How AI-driven development tools impact software observability
“I made this whole program in 5 minutes with just a few (Insert GenAI tool) prompts. Any developer not using AI tools to replace developers will find themselves out of a job in two years” – random AI fanboy on X Let’s face it, the next few years are going to…
Artificial vision system design and implementation based on BaSrTiO3 & Nd2O3 composite memristors for efficient pattern recognition
J. Mater. Chem. C, 2025, Advance ArticleDOI: 10.1039/D4TC04138K, PaperHui He, Yifei Pei, Liu Chao, Xiaobing YanThe artificial vision system based on the principle of a biomimetic vision system is used for digital recognition of optical signals.To cite this ar…
Chemical Imaging for Biological Systems: Techniques, AI-Driven Processing, and Applications
J. Mater. Chem. B, 2025, Accepted ManuscriptDOI: 10.1039/D4TB02876G, Review ArticleYing Cui, Zhihan Zhang, Yuan Shi, Yongjie HuVisualizing the chemical compositions of biological samples is pivotal to advancing biological sciences, with the past two decades w…
Visible-light driven photoelectric synaptic transistors based on ZnO/SnO2 heterostructure for neuromorphic computing
J. Mater. Chem. C, 2025, Accepted ManuscriptDOI: 10.1039/D5TC00822K, PaperRanran Ci, Zezhong Yin, Guangtan Miao, YuHui Wang, Guoxia Liu, Fukai ShanPhotoelectric synaptic transistors (PSTs) exhibit tremendous potential in artificial intelligence systems, as th…
Bhavish Aggarwal curbs ambition, seeks to raise $300mn for AI venture Krutrim
Krutrim, unveiled on 15 December 2023, is Aggarwal’s moonshot AI project that has claimed it will build foundational models for Indic languages, as well as India’s own graphic processing unit (GPU) chip, to make its own data centres and AI ecosystem. Krutrim …
CW Innovation Awards: DBS AI assistant slashes call times and boosts productivity
The bank’s in-house generative AI tool is streamlining customer service, reducing manual work and speeding up response times, improving both employee and customer experience
CW Innovation Awards: Finding revenue gains from parked cars
Singapore real estate investment firm CapitaLand Investment taps machine learning models to optimise space utilisation across 23 commercial buildings and business parks
AI Startup Mechanize Aims for ‘Automation of All Work’
Will artificial intelligence (AI) eventually replace humans in the workplace? The founder of Mechanize seems to think so. The startup — whose launch was covered by a Saturday (April 19) TechCrunch report — debuted last week with that mission, drawing some co…
What's next for AI at DeepMind, Google's artificial intelligence lab | 60 Minutes
At Google DeepMind, researchers are chasing what’s called artificial general intelligence: a silicon intellect as versatile as a human's, but with superhuman speed and knowledge.
This AI-Powered Walking Aid Redefines Rehabilitation with Smart, Hands-Free Patient Recovery
This AI-Powered Walking Aid Redefines Rehabilitation with Smart, Hands-Free Patient RecoveryHealthcare systems around the world are under pressure, and South Korea is no exception. With a single healthcare professional often responsible for 16 to 43...
Agentic AI: How Agents, Workflows, and Orchestration Are Evolving Marketing Automation
The marketing automation industry is undergoing a profound transformation, driven by agentic AI—autonomous systems that can analyze data, make decisions, and execute tasks without human intervention. The global Agentic AI in Enterprise IT market is poised for…
Grok Adds Memory Feature To Deliver More Personalized And Context-Aware Responses
Grok, the artificial intelligence chatbot developed by xAI, is receiving a major update with the introduction of a memory feature. This new capability allows the AI model to retain and recall information from previous interactions with users, making its respo…
Kling 2.0 Review: State of the Art AI Video Quality
The latest version of the Kling AI video generator has plenty of improvements in motion, camera control, and coherence.
Microsoft's BitNet shows what AI can do with just 400MB and no GPU
The BitNet b1.58 2B4T model was developed by Microsoft's General Artificial Intelligence group and contains two billion parameters – internal values that enable the model to understand and generate language. To compensate for its low-precision weights, the mo…
Dubai Sees Launch of Litepips as Avenix Fzco Brings AI to Commodity Trading
DUBAI, UAE, April 19, 2025 (GLOBE NEWSWIRE) -- In response to growing demand for smart automation in commodity markets, Avenix Fzco has launched Litepips, a specialized AI tool for trading the XAU/USD pair. Gold trading continues to evolve, and artificial int…
Himex WiseEye Powered Smart Display Featuring Spectra 6 Launched
Himax’s WiseEye ultra-low-power AI sensing solution is equipped with Spectra 6. The post Himex WiseEye Powered Smart Display Featuring Spectra 6 Launched first appeared on Good e-Reader.
Prediction: These Recent Alphabet AI Innovations Could Be Big Drivers for the Company
In This Article: While investors have questioned the impact of artificial intelligence (AI) on Alphabet's (NASDAQ: GOOGL) (NASDAQ: GOOG) business, the company is quietly becoming an AI leader. This is evident from two recent AI announcements. Firebase Studio …
Is SoundHound AI a Buy?
In This Article: SoundHound AI (NASDAQ: SOUN) is at the forefront of the artificial intelligence (AI) market. The company's voice AI technology is used for everything from vehicle infotainment systems to drive-thru ordering kiosks at restaurants. The stock wa…
Understanding Basic Probability Is The First Step To Better Models
Whether you’re training an LLM to draft SEC filings or building a fraud model, the math is the same. Bayes’ Theorem helps us reverse-engineer cause from effect.
Machine Learning Engineer Interview Course 2025 - Interview Kickstart Releases ML Engineer Course With Interview Prep
Santa Clara, April 18, 2025 (GLOBE NEWSWIRE) -- Santa Clara, California - Rapid innovations in machine learning and artificial intelligence...
Security Startup Pillar Raises $9 Million to Tackle AI-Specific Risks
Artificial intelligence (AI) security startup Pillar Security has raised $9 million in seed funding to expand its research and development (R&D) and go-to-market efforts. Pillar Security’s solution is designed to meet the needs of a new age in which “software…
Google Updates Workspace With More Capabilities
Google is upgrading Workspace, its rival to Microsoft 365, by embedding more artificial intelligence (AI) and other functionality into the platform. Workspace includes Google’s Gmail, Docs, Sheets, Meet, Google Drive, Slides, Calendar and other tools many wor…
When you report bugs on iOS, some content may be used for AI training
If you decide to report a bug on a beta version of iOS, you now apparently have to let Apple use the uploaded content for Apple Intelligence training with no way to opt out.If you want to report a bug on iOS, content you upload may be used for AI training.On …
AI interpretability startup Goodfire raises $50M in funding
Goodfire AI Inc., a startup that helps developers understand how their large language models work, has raised $50 million in funding to support research initiatives. The company announced the Series A round on Thursday. Menlo Ventures led the investment with …
Google makes Gemini Advanced free for US college students
The company said Gemini Advanced, including Deep Research, Gemini Live, Canvas and video generation products, will be free for college students. Gemini will be enabled in Google Docs, Sheets and Slides, and Google's experimental image and video generation too…
Police in Maharashtra's Thane-Palghar introduce AI-based traffic violation detection system
The MBVV police have implemented an AI-based multi-violation detection system to improve traffic management and road safety across Thane and Palghar districts. Launched as a pilot in March, the system automates the identification of traffic offenses, minimizi…
Databricks buys feature engineering startup Fennel to enhance AI model development
Databricks Inc. said today it has swooped to acquire a young company called Fennel AI Inc. for an undisclosed price so it can enhance its data intelligence platform with real-time feature engineering capabilities. According to Databricks, Fennel has developed…
AI startups look for acqui-hires, mergers to accelerate journey
Some of the startups that have made acquisitions in the last two years include Krutrim and Sarvam, with many companies evaluating potential deals, ET has learnt.
1Fort Raises $7.5 Million for AI-Powered Platform for Business Insurance
InsurTech startup 1Fort has raised $7.5 million in a funding round to enhance the artificial intelligence (AI) capabilities of its platform for business insurance. The company’s platform is designed to empower insurance brokers by automating processes that ar…
AI evaluation platform LMArena is becoming a real startup
The founders of the popular generative artificial intelligence benchmarking platform LMArena have said they’re founding an official company called Arena Intelligence Inc. to help them improve the project in future. LMArena’s founders wrote in a blog post toda…
How DeepSeek, deep pockets, and data centers are giving Asia an AI edge
In January, as business and government leaders descended on Davos, Switzerland, for this year’s meeting of the World Economic Forum, the prevailing wisdom was that American tech giants were winning the race for dominance in artificial intelligence while China…
Language and logic: Mustafa’s journey into AI research
My journey with artificial intelligence (AI) began with my deep-rooted passion for both computer science and philosophy. I found myself drawn to natural language processing (NLP), where the study of formal logic, languages and semantics seamlessly connect to …
Applied Generative AI Course Launched by Interview Kickstart - 2025 Best GenAI Course With Agentic AI Projects For Top AI Jobs at Google Meta Netflix Microsoft OpenAI Nvidia
Santa Clara, April 17, 2025 (GLOBE NEWSWIRE) -- Santa Clara, California - Following the recent announcement that Chinese technology giant Baidu...
What Agentic AI Could Mean For Security Operations
Agentic AI is reshaping security operations by reducing alert fatigue and enabling faster threat response—ushering in a new era of autonomous, transparent SOCs.
Extending the Light Response Range of Organic Photoelectric Synaptic Transistors by p-Doping
J. Mater. Chem. C, 2025, Accepted ManuscriptDOI: 10.1039/D5TC00345H, PaperJie Huang, Shunhong Dong, Qingdong ZhengOrganic optoelectronic synaptic transistors show great promise as a platform for future artificial visual synapses and integrated computing-stora…
Head of Google Search discloses AI Overviews' secrets in interview
Elizabeth Reid, head of Google Search, has answered about the secrets behind AI Overviews and its impact on online search in an interview.
How AI and Automation Can Improve Worker Productivity
This article, How AI and Automation Can Improve Worker Productivity, was published on the Millo.co blog. Thanks for reading! Both Artificial Intelligence (AI) and automation are currently rapidly transforming the way we live and work. These new emerging techn…
SAS outlines blueprint for deploying autonomous AI agents at scale
Enterprises are racing to reinvent how decisions get made and who, or what, makes them. Artificial intelligence is entering a new phase of transformation with the rise of agentic AI, a wave of autonomous systems that can operate, learn and decide on their own…
Agentic AI sparks a shift in enterprise automation architecture
Enterprise automation is entering a new era as agentic artificial intelligence systems evolve from rigid workflows into adaptive, intelligence-driven architectures. Rather than relying on static rules and siloed tools, organizations are now leveraging AI agen…
Report: OpenAI in Talks to Acquire AI-Powered Coding Tool Windsurf
OpenAI is reportedly in discussions with Windsurf, an artificial intelligence (AI)-powered coding tool, to acquire the technology. The deal is worth $3 billion and would be OpenAI’s largest acquisition to date, according to a Wednesday (April 16) report from…
Google Bolsters Ad Safety Efforts With Enhanced AI Tools
Google said Wednesday (April 16) that it strengthened its use of artificial intelligence (AI) in the ad safety realm to protect consumers and its advertising ecosystem by making more than 50 enhancements to its large language models (LLMs) in 2024. “These upd…
Codex CLI Open Source AI Tool Converts Language to Code
OpenAI’s Codex CLI is a local, open-source AI coding agent that enables developers to write, edit, and manage code from the terminal, bridging AI models with real-world programming workflows.
OpenAI in talks to buy AI-coding tool Windsurf for about $3B: report
The deal would be OpenAI's largest to date, the terms of which have not yet been finalized, the report said, adding the talks could change or fall apart.
Artificial General Intelligence (AGI) Is Ready For Us. We’re Not Ready For It.
What we have now in AI, impressive as it is, is only the first short step in the direction. of AGI.
Report: Anthropic Set to Add Voice Capabilities to AI Assistant Claude
Anthropic is reportedly set to add voice capabilities to its artificial intelligence (AI) assistant, Claude. The new “voice mode” could be released this month and will first be available on a limited basis, Bloomberg reported Tuesday (April 15), citing an unn…
Google DeepMind pushes for faster ASIC design, energy efficiency to meet AI surge
As artificial intelligence accelerates across industries, the demand for high-performance computing fuels the rapid development of application-specific integrated circuits (ASICs). Jeff Dean, Chief Scientist at Google DeepMind and Google Research, recently un…
Pocket FM logs 68% revenue growth in FY25
Audio series platform Pocket FM reported Rs 1,768 crore in FY25 revenue, up 68%, driven by growth in India, the US, and Europe. With ARR nearing Rs 2,000 crore, the company leverages AI to cut costs and expand content. It now eyes Latin America expansion, des…
Pharmacy education curriculum to be revised; move after decade to keep pace with latest advances
The updated curriculum will focus on modern technologies like AI and robotics, effective from the next academic year.
GPT-4.1 Models Released for Azure AI Foundry and GitHub
Microsoft launches GPT-4.1, 4.1-mini, and 4.1-nano on Azure and GitHub, offering advanced coding, million-token context, and fine-tuning support—empowering developers to build smarter, scalable AI solutions with greater precision
A Deep Peek into DeepSeek’s Talent and Implications for US Innovation
Chinese startup DeepSeek has upended conventional wisdom about artificial intelligence (AI) innovation. Released in January 2025, the company’s R1 language model and V3 general-purpose large language model (LLM) sent tremors through markets and challenged ass…
Runway AI’s Gen-4: How Can AI Montage Go Beyond Absurdity
AI video start-up Runway AI, now valued at $3 billion, attracts a slew of collaborations from film studios. But can AI films create stories that stick?
AI at the core: IBM z17 reimagines the mainframe for modern enterprise
For decades, IBM Corp.’s IBM Z has powered some of the world’s most sensitive, high-volume transactions. But, with the recent launch of the IBM z17, the narrative is shifting. IBM Z’s role is no longer just about maintaining uptime and supporting critical sy…
Netflix tests new AI search engine to recommend shows, movies
<p>Netflix is testing new search technology for subscribers that employs artificial intelligence to help them find TV shows and movies, expanding its use of the technology.</p>
Kling AI Advances to the 2.0 Era, Empowering Everyone to Tell Great Stories with AI
HONG KONG, April 15, 2025 (GLOBE NEWSWIRE) -- Kuaishou Technology (“Kuaishou” or the “Company”; HKD Counter Stock Code: 01024 / RMB Counter Stock Code: 81024), a leading content community and social platform, announced that Kling AI hosted the “From Vision to…
When 17 Doctors Couldn't Help, AI Did: How ChatGPT Helped A Mother Diagnose Son's Rare Condition - News18
The AI chatbot not only helped the family identify the child's rare condition but also guided them towards the urgent treatment the child needed.
Humans will soon understand dolphins thanks to AI, claim experts
The launch of a new artificial intelligence model has brought humans closer to understanding dolphins, experts claim.
OpenAI Developing AI Agent to Replace Software Engineers, CFO Says
OpenAI CFO Sarah Friar said the startup is building an artificial intelligence (AI) agent that can do all the work of software engineers, not just augment their skills, during a wide-ranging interview at a recent Goldman Sachs conference. “This is not just au…
Google Cloud’s Ironwood TPU Forges Better Enterprise AI
Google Cloud's commitment in infrastructure such as TPUs reflects the scale of investment required to maintain competitive positioning in the AI infrastructure market.
Zinox boss endorses AI radio for customer engagement
The Chairman of Zinox Group and renowned tech entrepreneur, Dr. Leo Stan Ekeh, has thrown his weight behind Konga 103.7FM, Nigeria’s first AI-powered music and commerce radio station, describing it as a transformative platform set to redefine how brands build…
AI may aid screening for opioid use disorder
An artificial intelligence (AI) screening tool identified patients at risk for opioid use disorder and helped reduce readmission to hospitals.
Google Cloud’s Vertex And Models Advance Enterprise AI Agent Adoption
Google Cloud's updated its Vertex AI platform to support AI agents with including the Agent Development Kit, the Agent2Agent protocol, Agent Engine, and Agentspace
Tech firm arms executives to harness AI’s potential
Reliance Infosystems is making a bold play to weave artificial intelligence into Nigeria’s business fabric, launching a programme recently in Lagos to arm executives with the tools to harness AI’s potential. The Lagos-based tech outfit teamed up with Lagos Bu…
AI App Startups Making Rapid Gains in Sales and Funding
Startups that build artificial intelligence (AI) applications on top of large language models (LLMs) are reportedly making rapid gains in both sales and funding. These startups are reaching as much as $200 million in annual recurring revenue in less than two …
Unraveling the adsorption potential of Zr dithiol (MOF-DSH) through experimentation and neural network modeling
RSC Adv., 2025, 15,11811-11825DOI: 10.1039/D5RA00002E, Paper Open Access This article is licensed under a Creative Commons Attribution 3.0 Unported Licence.Nitin Gumber, Buddhadev Kanrar, Jaspreet Singh, Jitendra Bahadur, Rajesh V. PaiEfficient removal …
How one man created 6 million Wikipedia articles, and why he stopped
A single bot generated and published millions of articles on the largest non-English version of Wikipedia. The results caused a rift among editors — and a glimpse of what the online encyclopedia might face from AI.
Apple to Tap User Data for LLM Training
Apple is planning to analyze user data to improve its large language model (LLM) software while upholding user privacy. The company has been using synthetic data to train its artificial intelligence (AI) models but has found that method to be ineffective, Ap…
China narrows AI gap with US, but investment constraints loom
According to the latest AI Index Report released by Stanford University's Human-Centered Artificial Intelligence Institute, China is rapidly catching up with the US in developing cutting-edge artificial intelligence (AI) models.
Beyond The Llama Drama: 4 New Benchmarks For Large Language Models
To foster the development of LLMs that are statistically proficient and genuinely useful partners it is time to complement existing metrics with four new dimensions
Safe Superintelligence’s Value Jumps Sixfold After $6 Billion Funding Round
Artificial intelligence (AI) startup Safe Superintelligence (SSI) has reportedly raised $6 billion in new funding. The round, as reported Friday (April 11) by the Financial Times (FT), values the company at $32 billion, a more than sixfold increase from the l…
Bytedance Hyper Realistic AI Video Animator, Dreamina, Nears Microsoft VASA-1 Level
One year ago Microsoft Researchers created a hyper releastic AI video animator, VASA-1, but did not release it. Now, Bytedance owner of TikTok, is getting closer to matching the VASA-1 ability to hyper-realistically animate the facial muscles of faces and lip…
Defence-tech startup Optimized Electrotech raises $6 million
India-Business News: BENGALURU: Deeptech startup Optimized Electrotech has raised $6 million in Series A funding to accelerate the development of its artificial intelligen.
InnoEX 2025 Opens in Hong Kong with Next-Gen Drones, AI, and Robotics
InnoEX 2025 has officially opened at the Hong Kong Convention and Exhibition Centre, bringing together innovators, tech companies, and industry leaders from across the globe. Organized by the Hong Kong SAR Government’s Innovation, Technology and Industry Bure…
Dublin AI firm’s new virtual changing-room tech takes fitting accuracy ‘to a new level’
Dublin artificial intelligence firm Swan AI plans to transform fashion e-commerce with its new virtual fitting room – allowing shoppers “try on” clothes from any product link.
Google’s cloud play: integrated AI from infrastructure to apps
We predict Google’s overall cloud business will reach $54 billion in revenue this year. Our models indicate that Google Cloud Platform will contribute to more than half of that revenue for the first time ever. Despite this performance, Google faces criticism …
After Ghibli, Tom and Jerry receives an AI makeover and the internet can't keep calm, see reactions
Artificial intelligence is now creating animated shorts. A new AI tool made a 'Tom and Jerry' episode from a text prompt. Tom chases Jerry in an office setting in the short video. Social media users have mixed reactions. Some praise the tech, others prefer th…
Great shot
<p>Revellers enjoy the <strong>Songkran</strong> water play on Bangkok's <strong>Khao San</strong> Road where CCTV camera efficiency is enhanced this year with an artificial intelligence system that memorises faces and measures crowdedness. Varuth Hirunyatheb…
Netflix Debuts OpenAI-Backed Search Engine That Lets You Discover Movies And TV Shows Based On Emotions, Not Just Titles
Netflix Inc. (NASDAQ:NFLX) has begun testing an artificial intelligence–powered search engine that enables subscribers to find movies and TV shows based on their emotions, preferences, or specific situations. What Happened: The feature, powered by OpenAI tech…
Nvidia, Alphabet To Back OpenAI Co-Founder Ilya Sutskever's New AI Startup Safe Superintelligence: Report
Nvidia Corp. (NASDAQ:NVDA) and Alphabet Inc. (NASDAQ:GOOG) (NASDAQ:GOOGL) have reportedly decided to invest in the artificial intelligence startup co-founded by former OpenAI chief scientist Ilya Sutskever. What Happened: The specific terms of Nvidia's and Al…
Exclusive-Alphabet, Nvidia invest in OpenAI co-founder Sutskever's SSI, source says
SAN FRANCISCO : Alphabet and Nvidia have joined prominent venture capital investors to back Safe Superintelligence (SSI), a startup co-founded by OpenAI's former chief scientist Ilya Sutskever that has quickly risen to become one of the most valuable artifici…
Autonomize AI setting new standard for operational efficiency and clinical empowerment in healthcare through agentic AI
Artificial Intelligence (AI) is swiftly becoming indispensable in healthcare, especially as hospitals and health systems face escalating operational costs, workforce limitations, and intricate administrative burdens. Rather than replacing professionals, AI fu…
VSCode Copilot agent mode in Codespaces · GitHub Changelog
AI & MLLearn about artificial intelligence and machine learning across the GitHub ecosystem and the wider industry. Generative AILearn how to build with generative AI. GitHub CopilotChange how you work with GitHub Copilot. LLMsEverything developers need to kn…
Bridging AI innovations from predictive analytics to neuromorphic computing
Artificial intelligence continues to expand its reach across various industries, introducing new methods to interpret data, streamline processes, and create novel opportunities for growth. In retail, AI-driven recommendations are influencing how consumers bro…
TCS Calls Up Group Veterans to Lead the Voyage into AI
The IT giant has also created a new position of chief strategy officer appointing Mangesh Sathe, who currently is the chief executive of Tata Strategic Management Group. Sathe, who had previously worked as a principal with the Boston Consulting Group, will al…
Netflix is testing a new OpenAI-powered search
Netflix is starting to test search that’s powered by OpenAI, according to Bloomberg. The new search engine will let users “look for shows using far more specific terms, including the subscriber’s mood, for example, the company said,” per the report. This Open…
Enterprise AI demands spark surge in high-performance storage innovation
As artificial intelligence pushes deeper into enterprise operations, AI storage solutions are emerging as critical enablers of speed, scale and competitive edge. With data-hungry models growing more complex by the day, businesses are under mounting pressure t…
ChatGPT got over 52 times more monthly visitors than Copilot
OpenAI’s ChatGPT is reportedly more popular than other artificial intelligence (AI) tools by a huge margin. According to date from aitools.xyz, ChatGPT received over 52 times more visitors than Microsoft Copilot did in February 2025. According to the report, …
Content Creators Beware: AI Avatars Could Replace You!
By 2027, video will dominate internet traffic—but with AI avatars and digital twins, creators can finally overcome the visibility gap and produce high-quality videos at scale. The post Content Creators Beware: AI Avatars Could Replace You! appeared first on j…
ZenCoder Update to AI Coding Platform Tightens DevOps Integration
ZenCoder has updated its artificial intelligence (AI) platform for writing and testing code to provide integration with third-party DevOps tools such as JIRA, GitHub, GitLab and Sentry in addition to tighter coupling with VS Code and JetBrains integrated deve…
AAIT 0.0.4.73
Advanced Artificial Intelligence Tools is a package meant to develop and enable advanced AI functionalities in Orange
GenAI vendor Writer releases agentic platform for business
The platform is geared for business users, the vendor's primary targets. However, the vendor must compete with bigger tech companies’ agentic tools.
Bay Area Gen-Z entrepreneur uses AI to help job seekers
Michael Yan, co-founder and CEO of Simplify, is on a mission to transform the job search process using artificial intelligence. After experiencing firsthand the frustrations of job hunting — applying to over 100 positions per cycle, manually entering repetiti…
ChatGPT4.5 Crosses The Turing Test Threshold
Machines have never been able to pass as humans in Turing's famous test of intelligence. That has finally changed.
Why Amgen Hired An AI Head From Nike
Big Pharma and biotech startups alike are counting on AI to speed the process of drug discovery and make their operations more efficient. But most of the best data scientists aren’t working in healthcare—at least not yet.