- Updated: January 5, 2026
- 6 min read
Tencent Unveils HY‑MT1.5: Powerful Multilingual Translation Models for Edge and Cloud
Tencent’s HY‑MT1.5 is a breakthrough multilingual machine‑translation family that ships two open‑source models—HY‑MT1.5‑1.8B and HY‑MT1.5‑7B—optimized for both on‑device (edge) and cloud deployments, supporting 33 languages and five dialects with state‑of‑the‑art accuracy, ultra‑low latency, and flexible quantization.
Why HY‑MT1.5 matters for AI developers and businesses
When Tencent’s Hunyuan research team unveiled HY‑MT1.5, the AI community got a rare blend of raw performance and deployment versatility. The announcement, originally reported by MarkTechPost, highlighted a model family that can run on a smartphone with just 1 GB of RAM while still rivaling the quality of multi‑billion‑parameter cloud services.
For developers building multilingual products—whether it’s a chatbot, a content‑generation pipeline, or a real‑time translation feature—HY‑MT1.5 offers a single training recipe that scales from edge to enterprise, dramatically reducing engineering overhead.

Model lineup: 1.8 B vs. 7 B
HY‑MT1.5 comes in two sizes, each targeting a distinct deployment scenario:
- HY‑MT1.5‑1.8B: A compact, distilled model with 1.8 billion parameters—roughly one‑third the size of its larger sibling. After FP8 or Int4 quantization it fits into ~1 GB memory, delivering sub‑200 ms latency on typical mobile CPUs.
- HY‑MT1.5‑7B: The flagship 7 billion‑parameter version, built on the award‑winning WMT25 champion Hunyuan‑MT‑7B. It excels in complex, mixed‑language contexts and is best suited for server‑grade or high‑end edge hardware where latency up to ~450 ms is acceptable for premium quality.
Both models support bidirectional translation across 33 languages and five dialects, and they expose three production‑ready prompt templates: terminology intervention, context‑aware translation, and format‑preserving translation.
Technical innovations that set HY‑MT1.5 apart
1. Multi‑stage training pipeline
The HY‑MT1.5 family follows a five‑phase pipeline designed exclusively for translation:
- General pre‑training: Large‑scale multilingual language modeling builds shared cross‑lingual representations.
- MT‑oriented pre‑training: Parallel corpora and translation‑specific objectives align the model with real‑world translation dynamics.
- Supervised fine‑tuning: High‑quality sentence‑ and document‑level pairs sharpen literal accuracy and domain coverage.
- On‑policy distillation: The 7 B teacher guides the 1.8 B student using reverse KL divergence on 1 M monolingual prompts, preserving most of the teacher’s behavior at a fraction of the cost.
- Reinforcement learning with rubric‑based rewards: Human‑rated rubrics (accuracy, fluency, idiomaticity, cultural appropriateness) feed a reward model that fine‑tunes both variants for real‑world quality.
2. Quantization for edge deployment
HY‑MT1.5‑1.8B ships with post‑training FP8 and Int4 quantized checkpoints (via GPTQ). FP8 retains >99% of the full‑precision XCOMET‑XXL scores, while Int4 reduces the footprint further at a modest quality trade‑off—still far ahead of most commercial APIs on the same hardware.
3. Prompt‑driven production features
- Terminology intervention: Inject domain‑specific term mappings (e.g., “混元珠 → Chaos Pearl”) to guarantee consistent branding.
- Context‑aware translation: Provide surrounding paragraphs so ambiguous words like “pilot” are interpreted correctly.
- Format‑preserving translation: Wrap source text in
<source>tags and receive output in<target>tags, preserving HTML/XML structure.
Benchmark performance: How HY‑MT1.5 stacks up
Tencent evaluated the models on three leading benchmarks—Flores‑200, WMT25, and a Mandarin‑to‑minority‑language suite—using XCOMET‑XXL and COMET‑Kiwi metrics.
| Benchmark | Model | Score (XCOMET‑XXL) | Latency (ms) |
|---|---|---|---|
| Flores‑200 (ZH→XX) | HY‑MT1.5‑7B | 0.8690 | ≈450 |
| Flores‑200 (EN→XX) | HY‑MT1.5‑7B | 0.9093 | ≈450 |
| WMT25 | HY‑MT1.5‑7B | 0.6159 | ≈450 |
| WMT25 | HY‑MT1.5‑1.8B | 0.5308 | ≈180 |
| Mandarin→Minority (XCOMET‑XXL) | HY‑MT1.5‑7B | 0.6174 | ≈450 |
| Mandarin→Minority (XCOMET‑XXL) | HY‑MT1.5‑1.8B | 0.5806 | ≈180 |
On human evaluation (0‑4 scale) the 1.8 B variant scored 2.74, surpassing Google, Microsoft, Baidu, and iFLYTEK translators in the same test conditions.
Real‑world use‑cases where HY‑MT1.5 shines
Because the models are both high‑quality and lightweight, they fit a wide spectrum of products:
- Mobile translation apps: Deploy the 1.8 B FP8 model on Android/iOS for offline, real‑time chat translation.
- Customer‑support chatbots: Combine ChatGPT and Telegram integration with HY‑MT1.5 to provide multilingual assistance directly inside Telegram.
- Content localization pipelines: Use the format‑preserving prompt to translate HTML newsletters while keeping tags intact.
- Enterprise knowledge bases: Integrate with the Enterprise AI platform by UBOS to auto‑translate internal documents across global teams.
- AI‑enhanced marketing: Pair with multilingual AI models for localized ad copy generation.
Developers can also embed the models in Web app editor on UBOS to prototype translation‑aware SaaS products without writing low‑level inference code.
How to get started with HY‑MT1.5
Both models are released under an open‑weight license on GitHub and Hugging Face. The repository includes:
- Full‑precision checkpoints.
- FP8 and Int4 quantized binaries ready for
ggufruntime. - Prompt templates for terminology, context, and format preservation.
- Evaluation scripts reproducing the Flores‑200 and WMT25 results.
To experiment quickly, you can spin up a free instance on the UBOS platform overview and import the model directly from Hugging Face. The platform’s Workflow automation studio lets you chain translation with downstream tasks such as sentiment analysis or voice synthesis.
For startups looking for a turnkey solution, the UBOS for startups program offers credits and dedicated support to integrate HY‑MT1.5 into MVPs.
SMBs can benefit from the UBOS solutions for SMBs, which bundle the model with UI components, monitoring dashboards, and auto‑scaling.
Boosting your translation workflow with UBOS templates
UBOS’s marketplace hosts ready‑made templates that pair perfectly with HY‑MT1.5:
- AI SEO Analyzer – run multilingual SEO audits on translated pages.
- AI Article Copywriter – generate localized blog drafts, then polish with HY‑MT1.5.
- AI Video Generator – create subtitles in dozens of languages using the 7 B model.
- AI Chatbot template – plug HY‑MT1.5 for multilingual conversational agents.
- AI LinkedIn Post Optimization – translate and optimize professional posts for global audiences.
These templates are fully compatible with the Chroma DB integration, enabling fast vector search over translated content.
Cost, licensing, and partnership options
While the model weights are free, production‑grade hosting and support are covered under UBOS’s flexible pricing. Review the UBOS pricing plans to choose between pay‑as‑you‑go or enterprise‑level contracts.
Developers and system integrators can also join the UBOS partner program to receive co‑marketing, technical enablement, and revenue‑share incentives for building translation‑centric solutions.
Bottom line: HY‑MT1.5 redefines multilingual AI for every scale
Tencent’s HY‑MT1.5 delivers a rare combination of open‑source accessibility, edge‑ready quantization, and benchmark‑leading quality. Whether you are a startup prototyping a multilingual chat app, an SMB needing reliable document translation, or an enterprise building a global knowledge platform, the model family fits seamlessly into the UBOS homepage ecosystem.
By leveraging the multilingual AI models already curated on UBOS, you can accelerate time‑to‑market, cut API costs, and retain full control over data privacy—critical factors for any AI‑driven product in 2026.
Stay ahead of the curve: explore the open‑source repo, test the quantized checkpoints on your device, and combine them with UBOS’s low‑code tools to unleash truly universal language experiences.