TechFeed · 工程与 AI 大厂技术博客

昨天5月20日周三7 篇

AWS ML9 小时前

Announcing OpenAI-compatible API support for Amazon SageMaker AI endpoints

Today, Amazon SageMaker AI introduces OpenAI-compatible API support for real-time inference endpoints. If you use the OpenAI SDK, LangChain, or Strands Agents, you can now invoke models on SageMaker AI by changing only your endpoint URL. You don’t need a custom client, a SigV4 wr…

GitHub12 小时前

Investigating unauthorized access to GitHub-owned repositories

If any impact is discovered, customers will be notified via established incident response and notification channels. The post Investigating unauthorized access to GitHub-owned repositories appeared first on The GitHub Blog.

AWS ML15 小时前

Multimodal evaluators: MLLM-as-a-judge for image-to-text tasks in Strands Evals

If you’re building visual shopping, image or document understanding, or chart analysis, you need a way to verify whether your model’s response is actually grounded in the source image. A text-only evaluator cannot tell you whether a caption faithfully describes an image, whether…

AWS ML16 小时前

Build real-time voice applications with Amazon SageMaker AI and vLLM

Voice agents, live captioning, contact center analytics, and accessibility tools all depend on real-time speech-to-text, where your application streams audio in and receives transcription back simultaneously over a single persistent connection. Traditional request-response infere…

OpenAI1 天前

An OpenAI model has disproved a central conjecture in discrete geometry

An OpenAI model solved the 80-year-old unit distance problem, disproving a major conjecture in discrete geometry and marking a milestone in AI-driven mathematics.

OpenAI1 天前

How Ramp engineers accelerate code review with Codex

How Ramp engineers use Codex with GPT-5.5 to review code and ship improvements, allowing them to get substantive feedback in minutes instead of hours.

OpenAI1 天前

The next phase of OpenAI’s Education for Countries

OpenAI advances Education for Countries, expanding AI adoption in schools with new partnerships, teacher training, and tools to improve global learning outcomes.

5月19日周二2026-05-1911 篇

OpenAI1 天前

Introducing OpenAI for Singapore

OpenAI for Singapore launches a multi-year AI partnership to expand deployment, build local talent, and support businesses and public services with AI.

Hugging Face1 天前

OlmoEarth v1.1: A more efficient family of Earth observation models

Google Research1 天前

Empirical Research Assistance (ERA): From Nature publication to catalyzing Computational Discovery

General Science

Airbnb1 天前

Scaling Airbnb’s identity graph with a unified knowledge graph infrastructure

How Airbnb shifts from PaaS to an internal knowledge graph infrastructure at scale. By: Lucen Zhao , Shukun Yang , Ashish Jain Knowledge graphs offer a natural and powerful way to represent relationships between entities. Many real-world systems are fundamentally about connection…

AWS ML1 天前

Scalable voice agent design with Amazon Nova Sonic: multi-agent, tools, and session segmentation

In this post, you’ll learn how to use Amazon Nova Sonic, Amazon Bedrock AgentCore, and Strands BidiAgent to build scalable, maintainable voice agents that handle these challenges efficiently, resulting in more responsive and intelligent customer interactions. We’ll explore three…

AWS ML1 天前

Extending conversational memory in Kiro CLI using Amazon Bedrock AgentCore Memory

In this post, we demonstrate how you can extend the conversational memory of Kiro CLI by implementing a custom Model Context Protocol (MCP) server that integrates with Amazon Bedrock AgentCore Memory. You can use Kiro CLI to interact with AI agents of Kiro directly from your term…

AWS ML1 天前

Accelerate ML feature pipelines with new capabilities in Amazon SageMaker Feature Store

Today, we’re announcing three new capabilities available in SageMaker Python SDK v3.8.0. In this post, we walk through each capability with code examples you can use to get started. For complete end-to-end walkthroughs, see the accompanying notebooks for Lake Formation governance…

AWS ML1 天前

Implementing programmatic tool calling on Amazon Bedrock

In this post, we show three ways to implement Programmatic tool calling (PTC) on Amazon Bedrock: a self-hosted Docker sandbox on ECS for maximum control, a managed solution using Amazon Bedrock AgentCore Code Interpreter, and an Anthropic SDK-compatible path through a proxy for t…

Cloudflare1 天前

Announcing Claude Managed Agents on Cloudflare

Cloudflare has integrated with Anthropic's Claude Managed Agents to provide a fast, isolated execution environment for autonomous code delivery. This means builders can scale agent workflows globally while strictly controlling access to private backends and easily customizing the…

OpenAI1 天前

Advancing content provenance for a safer, more transparent AI ecosystem

OpenAI advances AI content provenance with Content Credentials, SynthID, and a verification tool to help people identify and trust AI-generated media.

Hugging Face2 天前

Introducing the Ettin Reranker Family

5月18日周一2026-05-1814 篇

Netflix2 天前

The Evolution of Cassandra Data Movement at Netflix

By Guil Pires , Jennifer Prince , Jose Camacho , Ken Kurzweil , Phanindra Chunduru Background In a previous post, we introduced Data Bridge , a unified management plane for batch Data Movement at Netflix. Historically, several bespoke Data Movement connectors were developed acros…

AWS2 天前

AWS Weekly Roundup: AWS Transform at 1 year, Claude Platform on AWS, EC2 M3 Ultra Mac instances, and more (May 18, 2026)

Just a year ago, we launched AWS Transform for .NET, Mainframe and VMware workloads, the first agentic AI service purpose-built for modernizing enterprise applications at scale. At re:Invent 2025, we introduced AWS Transform custom, which enables organizations to modernize and tr…

AWS ML2 天前

Prompting Amazon Nova 2 for content moderation

In this post, you learn how to prompt Amazon Nova 2 Lite for content moderation using structured and free-form approaches, grounded in the MLCommons AILuminate Assessment Standard. The prompting techniques use the AILuminate taxonomy as an example, but they work equally well with…

DeepMind2 天前

Fast-tracking genetic leads to reverse cellular aging

Biologists use Co-Scientist to find novel factors that successfully rejuvenate human cells.

AWS ML2 天前

Aderant transforms cloud operations with Amazon Quick

In this post, we share how Aderant used the AI-powered capabilities of Amazon Quick to unify search across six vendor systems and automate documentation workflows, achieving 90 percent faster search times and 75 percent documentation acceleration, and how others can apply these a…

GitHub2 天前

Take your local GitHub sessions anywhere

Kick off work in VS Code or the CLI, finish it from your phone. Remote control for GitHub Copilot sessions is now generally available on github.com and GitHub Mobile. The post Take your local GitHub sessions anywhere appeared first on The GitHub Blog.

Hugging Face2 天前

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

AWS ML2 天前

Integrate Atlassian Confluence Cloud with Amazon Quick

In this post, you will learn how to set up the Confluence Cloud integration with Quick. This includes creating a knowledge base for semantic search, setting up Actions to query and manage Confluence pages, and organizing resources in Quick Spaces. Quick integrates with your curre…

Hugging Face2 天前

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

AWS ML2 天前

Build custom code-based evaluators in Amazon Bedrock AgentCore

In this post, you will implement four Lambda-based custom code evaluators for a financial market-intelligence agent, register each with AgentCore, and run them in on-demand and online modes. You will also see how to combine custom code-based evaluators with built-in evaluators an…

Hugging Face2 天前

The Open Agent Leaderboard

Spotify2 天前

Better Experiments with LLM Evals — A funnel, not a fork

TL;DR LLM evals, automated judges that assess relevance, coherence, and quality at scale, are a powerful new... The post Better Experiments with LLM Evals — A funnel, not a fork appeared first on Spotify Engineering.

OpenAI2 天前

OpenAI and Dell partner to bring Codex to hybrid and on-premise enterprise environments

OpenAI and Dell partner to bring Codex to hybrid and on-premise environments, helping enterprises deploy AI coding agents securely across data and workflows.

Cloudflare3 天前

Project Glasswing: what Mythos showed us

In recent weeks, we pointed Mythos and other security-focused LLMs at live code across critical parts of our infrastructure. We share what we observed, the models’ strengths and weaknesses, and what the work around them needs to look like before any of it can scale.

5月17日周日2026-05-175 篇

DeepMind3 天前

Simulate real-world places with Project Genie and Street View

We’re expanding access to Google AI Ultra subscribers globally and introducing a new capability powered by Street View.

DeepMind3 天前

Introducing Gemini Omni

DeepMind3 天前

Introducing Google Antigravity 2.0

DeepMind3 天前

Gemini for Science: AI experiments and tools for a new era of discovery

A collection of science tools and experiments to expand the scale and precision of scientific exploration.

DeepMind3 天前

Making it easier to understand how content was created and edited

We're expanding our tools to help you understand how content was created and edited across the web.

5月16日周六2026-05-168 篇

DeepMind4 天前

Strengthening Singapore’s AI Future: A New National Partnership

Google DeepMind and Singapore partner to apply frontier AI to address complex challenges across health, education, and sustainability and more.

DeepMind5 天前

Finding the molecular switches behind new infectious diseases

Clare Bryant uses Co-Scientist to identify genetic triggers in emerging infectious diseases.

DeepMind5 天前

Opening new paths in aging research

Calico Life Sciences uses Co-Scientist to connect scattered findings and generate new leads in aging research.

DeepMind5 天前

Accelerating discovery of liver disease mechanisms

Filippo Menolascina uses Co-Scientist to identify new liver disease treatments and explain why existing drugs only help certain patients.

DeepMind5 天前

Uniting biological toolkits for a new approach to ALS

Co-Scientist unites Boston Children’s Hospital and MIT’s labs to explore new RNA-based treatments for ALS.

DeepMind5 天前

Uncovering repurposed medicines to fight liver fibrosis

Stanford geneticist uses Co-Scientist to help find new treatments for chronic liver disease and liver fibrosis.

DeepMind5 天前

How WeatherNext helped the National Hurricane Center better predict Hurricane Melissa’s historic landfall in Jamaica

Learn how our WeatherNext AI model help forecasters give communities unprecedented time to prepare ahead of the historic Hurricane Melissa.

OpenAI5 天前

OpenAI and Malta partner to bring ChatGPT Plus to all citizens

OpenAI and Malta partner to expand AI access, offering ChatGPT Plus and training to help citizens build practical AI skills and use AI responsibly.

5月15日周五2026-05-1512 篇

DeepMind5 天前

Gemini 3.5: frontier intelligence with action

Gemini 3.5 is built to help you execute complex, agentic workflows.

Microsoft Research5 天前

Further Notes on Our Recent Research on AI Delegation and Long-Horizon Reliability

Our recent paper, “LLMs Corrupt Your Documents When You Delegate”, has generated discussion about the reliability of AI systems in delegated workflows. We appreciate the interest in this work and want to clarify several important points about what the paper does—and does not—clai…

GitHub5 天前

Building a general-purpose accessibility agent—and what we learned in the process

Learn about the experimental general-purpose accessibility agent that GitHub is piloting. The post Building a general-purpose accessibility agent—and what we learned in the process appeared first on The GitHub Blog.

AWS ML5 天前

Restrict access to sensitive documents in your Amazon Quick knowledge bases for Amazon S3

In this post, we walk through how to configure document-level ACLs for your S3 knowledge base in Amazon Quick. You will learn how to set up and verify an ACL configuration that enforces document-level permissions across chat and automated workflows.

GitHub5 天前

Raising the bar: Quality, shared responsibility, and the future of GitHub’s bug bounty program

We're updating our bug bounty program standards to prioritize quality submissions, clarify shared responsibility boundaries, and evolve how we reward low-risk findings. The post Raising the bar: Quality, shared responsibility, and the future of GitHub’s bug bounty program appeare…

Amazon Science5 天前

Making LLMs faster without sacrificing accuracy

A new scaling law that relates particular architectural choices to loss helps identify models that improve throughput by up to 47% with no loss of accuracy.

OpenAI6 天前

Databricks brings GPT-5.5 to enterprise agent workflows

Databricks uses GPT-5.5 for enterprise agent workflows after the model set a new state of the art on the OfficeQA Pro benchmark.

OpenAI6 天前

How sales teams use Codex

See how sales teams can use Codex to create pipeline briefs, meeting prep packets, forecast reviews, account plans, and stalled-deal diagnoses from real work inputs.

OpenAI6 天前

How data science teams use Codex

See how data science teams can use Codex to build root-cause briefs, impact readouts, KPI memos, scoped analyses, and dashboard specs from real work inputs.

OpenAI6 天前

How business operations teams use Codex

See how business operations teams can use Codex to create initiative briefs, strategy updates, leadership decision packets, progress updates, and more from real work inputs.

OpenAI6 天前

A new personal finance experience in ChatGPT

Preview a new personal finance experience in ChatGPT for Pro users in the U.S. Securely connect your financial accounts and get AI-powered insights and guidance grounded in your financial context, goals, and priorities.

美团6 天前

美团 LongCat 开源 General 365：树立推理评测新标尺

美团 LongCat 团队正式发布 General 365。我们发现，在对 26 款主流模型的实测中，目前地表最强的 Gemini 3 Pro 准确率仅为 62.8%，而绝大多数模型甚至没能摸到 60 分的及格线。

5月14日周四2026-05-1412 篇

AWS6 天前

Amazon Bedrock introduces new advanced prompt optimization and migration tool

Amazon Bedrock Advanced Prompt Optimization enables customers to optimize their prompts for their current model or migrate prompts to new models faster than before with built-in evaluation feedback loops. Optimize your prompts and compare results for up to 5 models simultaneously…

GitHub6 天前

GitHub availability report: April 2026

In April, we experienced 10 incidents that resulted in degraded performance across GitHub services. The post GitHub availability report: April 2026 appeared first on The GitHub Blog.

OpenAI6 天前

Sea's View on the Future of Agentic Software Development with Codex

Sea Limited's CPO explains why the company is deploying Codex across engineering teams to accelerate AI-native software development in Asia.

Hugging Face6 天前

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

AWS ML6 天前

Improve bot accuracy with Amazon Lex Assisted NLU

In this post, you will learn how to implement Assisted NLU effectively. You will learn how to improve your bot design with effective intent and slot descriptions, validate your implementation using Test Workbench, and plan your transition from traditional NLU to Assisted NLU for…

AWS ML6 天前

Real-time voice agents with Stream Vision Agents and Amazon Nova 2 Sonic

In this post, you learn how to combine Stream's Vision Agents open-source framework with Amazon Bedrock and Amazon Nova 2 Sonic to build real-time voice agents that can be production-ready in minutes. You'll learn how the integration works under the hood, walk through code exampl…

AWS ML6 天前

From siloed data to unified insights: Cross-account Athena Access for Amazon Quick

Today, we're announcing cross-account Athena access for Amazon Quick. With this feature, customers can query Athena data in other AWS accounts using AWS Identity and Access Management (IAM) role chaining, with query costs billed to the account where the data resides.

AWS ML6 天前

Control where your AI agents can browse with Chrome enterprise policies on Amazon Bedrock AgentCore

In this post, you will configure Chrome enterprise policies to restrict a browser agent to a specific website, observe the policy enforcement through session recording, and demonstrate custom root CA certificates using a public test site. The walkthrough produces a working soluti…

GitHub6 天前

From latency to instant: Modernizing GitHub Issues navigation performance

How the GitHub Issues team used client-side caching, smart prefetching, and service workers to make navigation feel instant. The post From latency to instant: Modernizing GitHub Issues navigation performance appeared first on The GitHub Blog.

Amazon Science6 天前

Promptimus: Improving already good LLM prompts with zero manual engineering

By focusing on specific failure points and suggesting targeted solutions, a new automated prompt-engineering framework improves prompt performance without compromising existing functionality.

OpenAI6 天前

Work with Codex from anywhere

Use Codex anywhere with the ChatGPT mobile app. Monitor, steer, and approve coding tasks in real time across devices and remote environments.

Cloudflare6 天前

Our billing pipeline was suddenly slow. The culprit was a hidden bottleneck in ClickHouse

When a partitioning change to our petabyte-scale ClickHouse cluster caused critical billing jobs to stall, standard metrics showed no obvious errors. This post explores how we identified severe lock contention in ClickHouse's query planner and built upstream patches to fix it.