单卡4090微调DeepSeek-R1-32B

在线工具

在线工具

token中转
排行榜

我的
首页
工具
文库
码库
软件
网址
话题
小摊

反馈

在线工具

首页话题

文库码库小摊

文章
文稿
书库
图册

单卡 4090 微调 DeepSeek-R1-32B

出处：mp.weixin.qq.com

摘要

在24G显存的单卡RTX 4090上，使用unsloth和lora量化微调技术，成功对62G的deepseek-ai/DeepSeek-R1-Distill-Qwen-32B模型进行全量微调。训练数据为24772条，共9288步，耗时28小时28分37秒。优化技术显著减少了显存占用，提升了训练效率。

阅读原文

xiaozi 于 2025-02-14 分享

4901

关联话题： #DeepSeek #unsloth #Fine-tuning

欢迎在评论区写下你对这篇文章的看法。

据说喜欢分享的,后来都成了大神

知鸦日报

每日精选

提交句子

世界那么大，能认识你，我觉得好不幸。

文库

1 腾讯混元AI Infra如何优化Hy3 Preview：一次大模型推理性能提升的技术拆解
2 京东健康OPC团队的产品全流程Skill探索
3 QoderWork Skills 开发实践：从传统数科到 AI 数科的转型探索-我的Skills进阶之旅
4 Loop Engineering 到底是什么？看这一篇就够了
5 「秘伝のタレ」を未来のシステムへ繋ぐ ── フィーチャー分割で実現した商品詳細APIリプレイス
6 Privacy-Aware Infrastructure in the AI-Native Era: An Asset Classification Case Study
7 Claude CodeがSOC業務を全自動でやってくれるってさ
8 Adopting AV1 for Real-Time Communication (RTC) at Scale
9 Principles in motion
10 Autoresearch isn’t just for training models
11 We replaced Redis with MySQL for inventory reservations—and it scaled
12 Quick: An internal hosting platform for the AI era
13 Teaching Sidekick to say no: automated data curation with LLM judge consensus
14 Clustering billions of products for agentic commerce with Catalog API
15 The Data Canary: How Netflix Validates Catalog Metadata