竞品新能力周报

🔷硅基流动

有更新

📄 https://docs.siliconflow.cn/cn/release-notes/overview · 2026-06-25T14:05:00+08:00

更新GLM-4.7、Kimi-K2.5 等多款模型将于近期下线

GLM-4.7、Kimi-K2.5 等多款模型将于近期下线

更新2026-05-15多模型下线通知 + 实名认证要求升级

2026-05-15多模型下线通知 + 实名认证要求升级

⚡无问芯穹

有更新

📄 https://docs.infini-ai.com/gen-studio/changelog.html · 2026-06-25T14:08:00+08:00

更新Qwen3系列、Kimi-K2-Thinking、DeepSeek-V4-Flash-Llama等14个模型即将于6月30日下线

Qwen3系列、Kimi-K2-Thinking、DeepSeek-V4-Flash-Llama等14个模型即将于6月30日下线

更新2026-06-18 Qwen3.6系列模型上线 / 2026-06-17 GLM-5.2模型上线

2026-06-18 Qwen3.6系列模型上线 / 2026-06-17 GLM-5.2模型上线

更新2026-06-16 预算管理功能上线，新增错误码10024

2026-06-16 预算管理功能上线，新增错误码10024

🌐阿里百炼

有更新

📄 https://bailian.console.aliyun.com/ · 2026-06-25T14:10:00+08:00

更新2026-06-09 模型评测排行榜与综合评测功能上线

2026-06-09 模型评测排行榜与综合评测功能上线

更新模型导入国际站、临时API Key、Responses API异步调用、通义听悟Agent ASR资源包、Spring AI Alibaba

模型导入国际站、临时API Key、Responses API异步调用、通义听悟Agent ASR资源包、Spring AI Alibaba

🟢PPIO

有更新

📄 https://blog.ppio.com/ · 2026-06-25T14:12:00+08:00

更新首发上线GLM-5.2、Kimi K2.7 Code；MiniMax M3上线

首发上线GLM-5.2、Kimi K2.7 Code；MiniMax M3上线

更新入选36氪2026最具价值成长企业100、信通院Token攀登计划、量子位2026 AIGC企业大奖

入选36氪2026最具价值成长企业100、信通院Token攀登计划、量子位2026 AIGC企业大奖

🌋火山方舟

有更新

📄 https://www.volcengine.com/docs/82379/1159177 · 2026-06-25T14:15:00+08:00

更新Doubao-Seed-2.0（256k上下文）、Doubao-Seedance-2.0（视频生成）、Doubao-Seedream-5.0（图片生成）

Doubao-Seed-2.0（256k上下文）、Doubao-Seedance-2.0（视频生成）、Doubao-Seedream-5.0（图片生成）

更新方舟Coding Plan订阅服务、Responses API、方舟文档MCP协议支持

方舟Coding Plan订阅服务、Responses API、方舟文档MCP协议支持

🎆Fireworks

有更新

📄 https://docs.fireworks.ai/updates/changelog · 2026-06-25T14:10:00+08:00

更新Serverless服务下线Kimi K2.5、Qwen 3.6 Plus、MiniMax M2.5；Audio和Image模型弃用

Serverless服务下线Kimi K2.5、Qwen 3.6 Plus、MiniMax M2.5；Audio和Image模型弃用

更新GLM 5.2、Kimi K2.7 Code、MiniMax M3、Qwen 3.7 Plus上线

GLM 5.2、Kimi K2.7 Code、MiniMax M3、Qwen 3.7 Plus上线

更新Video & Audio Models、AWS S3 Training、JIT SSO

Video & Audio Models、AWS S3 Training、JIT SSO

🎯 模型定制能力

微调Warm-Start RFT

支持从SFT checkpoint继续RFT训练，实现SFT-to-RFT无缝工作流，使用--warm-start-from标志指定checkpoint

RFTSFTwarm-startcheckpoint

https://docs.fireworks.ai/fine-tuning/warm-start

微调VLM多模态微调

支持Qwen 2.5 VL模型家族的多模态微调(3B/7B/32B/72B)，支持图文JSONL格式，最高64K上下文训练

VLM多模态Qwen2.5-VL

https://docs.fireworks.ai/fine-tuning/fine-tuning-vlm

微调微调作业管理

微调作业支持停止后恢复、基于现有配置克隆新作业，适用于SFT和RFT任务

SFTRFT作业管理

https://docs.fireworks.ai/fine-tuning/fine-tuning-models

微调RFT with DPO

支持DPO(直接偏好优化)和强化学习微调，提供自定义rollout和reward开发能力，支持多轮训练

RFTDPORLHF偏好优化

https://docs.fireworks.ai/fine-tuning/reinforcement-fine-tuning-models

微调加权训练

RFT支持逐样本加权训练，控制不同样本对训练的影响程度

加权训练样本权重

https://docs.fireworks.ai/fine-tuning/fine-tuning-models

微调QAT量化感知训练

SFT V2支持量化感知训练，在训练过程中考虑量化影响，提升低精度部署效果

QAT量化感知训练

https://docs.fireworks.ai/fine-tuning/fine-tuning-models

评测Evaluator & MLOps

Evaluator创建体验增强，支持GitHub模板集成；新增Weights & Biases和MLflow集成，支持实验追踪

EvaluatorW&BMLflow实验追踪

https://docs.fireworks.ai/ecosystem/integrations/wandb

数据AWS S3 BYOB数据集

训练数据集可存储在自有AWS S3桶中，通过GCP-to-AWS OIDC联邦实现安全访问，无需长期凭证

S3BYOB数据集安全训练

https://docs.fireworks.ai/fine-tuning/secure-fine-tuning

数据数据集下载

支持从Fireworks Web App直接下载数据集，可下载单个文件或批量ZIP下载

数据集下载数据管理

https://docs.fireworks.ai/fine-tuning/fine-tuning-models

部署LoRA快速部署

支持live merge或multi-LoRA部署，速度匹配基础模型，单命令简化部署流程

LoRAlive mergemulti-LoRA

https://docs.fireworks.ai/fine-tuning/deploying-loras

🤝Together AI

有更新

📄 https://docs.together.ai/docs/changelog · 2026-06-25T14:18:00+08:00

更新Whoami API端点、Early stopping for fine-tuning、Audio transcription上传限制提升至500MB、LoRA adapter支持附加到专用端点

Whoami API端点、Early stopping for fine-tuning、Audio transcription上传限制提升至500MB、LoRA adapter支持附加到专用端点

更新GLM-5.2 serverless、Kimi K2.7 Code serverless、MiniMax M3 serverless上线；GLM-5、Qwen3.5-397B-A17B下线预告

GLM-5.2 serverless、Kimi K2.7 Code serverless、MiniMax M3 serverless上线；GLM-5、Qwen3.5-397B-A17B下线预告

更新DeepSeek-V4-Pro降价

DeepSeek-V4-Pro降价

🎯 模型定制能力

微调Early Stopping早停

微调作业支持早停功能，当验证损失停止改善时自动停止训练，减少成本并避免过拟合，支持patience/min_delta等参数调优

早停过拟合验证损失

https://docs.together.ai/docs/fine-tuning/early-stopping

微调Metrics API

新增GET /fine-tunes/{id}/metrics端点，程序化获取训练指标(损失曲线等)，无需打开dashboard即可监控进度

Metrics API监控损失曲线

https://docs.together.ai/docs/fine-tuning/metrics-api

微调新增微调模型

新增支持Qwen3.6-35B-A3B、Gemma-4-31B/26B微调，扩展大模型微调能力

Qwen3.6Gemma-4微调

https://docs.together.ai/docs/fine-tuning/supported-models

微调FP4/INT8量化

支持FP4和INT8量化部署，降低推理成本，多个模型(如GLM-5.2、Gemma-4)已支持FP4量化

FP4INT8量化部署

https://docs.together.ai/docs/inference/quantization

评测Evaluations框架

LLM-as-judge评测框架，支持compare/classify/score功能，可创建自定义评测套件

LLM-as-judge评测框架

https://docs.together.ai/docs/evaluation/overview

评测Single-pass对比

compare评测支持disable_position_bias_correction参数，单次运行降低50%成本和延迟

compare评测位置偏置

https://docs.together.ai/docs/evaluation/compare

数据数据集验证

微调数据集上传后经过完整服务端schema验证，暴露processing_status和validation_report，可在训练前检测缺失role字段等问题

数据验证schema验证质量检查

https://docs.together.ai/docs/fine-tuning/dataset-validation

数据HuggingFace集成

支持微调任意<100B参数的Hugging Face Hub CausalLM模型

HuggingFace数据集集成

https://docs.together.ai/docs/fine-tuning/huggingface

部署LoRA多适配器

支持将多个LoRA适配器附加到单个LoRA专用端点，共享硬件资源

LoRA多适配器硬件共享

https://docs.together.ai/docs/fine-tuning/lora-multi-adapter

☁️Google Vertex AI

有更新

📄 https://cloud.google.com/vertex-ai/docs/release-notes · 2026-06-25T14:22:00+08:00

更新Data Science Agent正式发布，同时Vertex AI Extensions宣布弃用（2026-11-26关闭）

Data Science Agent正式发布，同时Vertex AI Extensions宣布弃用（2026-11-26关闭）

更新Gemma 4 26B A4B IT、Veo 3.1 Lite公测

Gemma 4 26B A4B IT、Veo 3.1 Lite公测

更新Imagen和视频生成GA端点弃用（建议6月30日前迁移）、Gemini 2.5系列退役日期更新至2026-10-16

Imagen和视频生成GA端点弃用（建议6月30日前迁移）、Gemini 2.5系列退役日期更新至2026-10-16

⚙️Novita AI

有更新

📄 https://docs.novita.ai/ · 2026-06-25T14:28:00+08:00

更新GLM Text to Speech、GLM Audio to Text、GLM Voice Clone、GLM Image Generation、Hunyuan Image 3将于7月7日下线

GLM Text to Speech、GLM Audio to Text、GLM Voice Clone、GLM Image Generation、Hunyuan Image 3将于7月7日下线

更新推荐替代: MiniMax Speech 2.8 Turbo、MiniMax Quick Voice Cloning、Seedream 5.0 lite

推荐替代: MiniMax Speech 2.8 Turbo、MiniMax Quick Voice Cloning、Seedream 5.0 lite

📦AWS Bedrock

有更新

📄 https://aws.amazon.com/bedrock/ · 2026-06-25T14:30:00+08:00

更新2026-06-01 GPT-5.5/GPT-5.4/Codex GA / 2026-05-28 Claude Opus 4.8上线

2026-06-01 GPT-5.5/GPT-5.4/Codex GA / 2026-05-28 Claude Opus 4.8上线

更新AMI Watermarks、EMR Serverless实时配置更新、AWS MCP Server跨账户访问、Bedrock Service Quotas扩展

AMI Watermarks、EMR Serverless实时配置更新、AWS MCP Server跨账户访问、Bedrock Service Quotas扩展

🧠AWS SageMaker AI

有更新

📄 https://aws.amazon.com/sagemaker/ · 2026-06-25T14:32:00+08:00

更新SageMaker Notebook实例GPU扩展（P6-B200、P5en、P4de）

SageMaker Notebook实例GPU扩展（P6-B200、P5en、P4de）

更新Unified Studio notebook scheduling、Multi-turn RL for agent customization、ECS Managed Instances支持Trainium/Inferentia、Studio快速设置（20秒内）

Unified Studio notebook scheduling、Multi-turn RL for agent customization、ECS Managed Instances支持Trainium/Inferentia、Studio快速设置（20秒内）

🔀OpenRouter

有更新

📄 https://openrouter.ai/docs · 2026-06-25T14:35:00+08:00

更新AI Governance Checklist、AI Data Residency指南发布

AI Governance Checklist、AI Data Residency指南发布

更新OpenClaw、SillyTavern、Kilo Code、OpenAI Codex CLI、Claude Code集成指南发布

OpenClaw、SillyTavern、Kilo Code、OpenAI Codex CLI、Claude Code集成指南发布

更新OpenRouter vs Portkey、OpenRouter vs LiteLLM对比指南

OpenRouter vs Portkey、OpenRouter vs LiteLLM对比指南