上海交通大学毕业设计 · 2025 · 核心组员
Shanghai Jiao Tong University · Capstone · 2025 · Core team member
Demo
毕业设计小组(Group 21)项目演示视频(YouTube)。
Capstone team (Group 21) demo on YouTube.
Context
新能源汽车普及提速,全球最早一批动力电池已濒临报废,退役电池的安全高效处置成为亟待解决的行业问题。现有的两种主流方案均存在明显局限:
As EV adoption accelerates, early battery packs are reaching end-of-life. Safe, efficient teardown of retired packs is an urgent industry challenge. The two dominant approaches both have clear limits:
Architecture
Ownership
市场上厂商仅提供电池安装说明书,拆解流程缺乏标准化文档——这正是引入大模型的核心原因:借助其推理能力,从安装逻辑逆向推导拆解步骤,并预判潜在危险情境。基于此,系统对比评估多款多模态大模型,最终选定 Gemini 2.0 Flash 作为任务规划核心模型。
OEMs ship installation manuals only—no standardized teardown docs. That is why we use an LLM: to infer disassembly steps from install logic and anticipate risky situations. We benchmarked multimodal models and chose Gemini 2.0 Flash as the planning backbone.
| 模型 | 任务准确性 | 任务覆盖程度 | 危险情境预判 | 工业术语理解 | 多模态能力 | 成本 |
|---|---|---|---|---|---|---|
| GPT-4o | ✓ 高 | ✓ 完整 | ✓ 强 | ✓ 强 | ✓ 强 | 高 |
| Qwen 系列 | △ 中等 | △ 部分 | △ 一般 | ✓ 强(中文) | △ 部分 | 低 |
| Gemini 1.5 系列 | ✓ 高 | △ 较完整 | △ 一般 | ✓ 强 | ✓ 强 | △ 中等 |
| Gemini 2.0 Flash ✓ | ✓ 高 | ✓ 完整 | ✓ 强 | ✓ 强 | ✓ 强 | 低 |
| Model | Accuracy | Coverage | Risk foresight | Industrial terms | Multimodal | Cost |
|---|---|---|---|---|---|---|
| GPT-4o | ✓ High | ✓ Full | ✓ Strong | ✓ Strong | ✓ Strong | High |
| Qwen | △ Medium | △ Partial | △ Fair | ✓ Strong (ZH) | △ Partial | Low |
| Gemini 1.5 | ✓ High | △ Mostly full | △ Fair | ✓ Strong | ✓ Strong | △ Medium |
| Gemini 2.0 Flash ✓ | ✓ High | ✓ Full | ✓ Strong | ✓ Strong | ✓ Strong | Low |
针对 BERT 和 n-gram 在检测任务重叠与冗余时精度不足的问题,设计并实现了基于 DiffCSE + Cross-Encoder 的双阶段语义验证流程。
BERT and n-gram similarity were too weak for overlap and redundancy in industrial steps; we implemented a DiffCSE + Cross-Encoder two-stage pipeline.
协助团队完成各模块的集成对接与联调测试,确保系统整体流程的稳定运行。
Helped integrate modules and run joint tests so the end-to-end pipeline stayed stable.
Outcome