Loading Heatmap…

deng synced commits to dev at deng/mindformers from mirror

  • d7d265a0d3 !5711 【dev】【bug-fix】添加环境变量排除protobuf版本对vlmevalkit评测的影响;统一评测参数格式;去掉config_path参数 Merge pull request !5711 from zhouxq/vlmevalkit-bugfix
  • e15cbe876c !4773 【bugfix】优化run_check版本检测逻辑并更新版本信息 Merge pull request !4773 from yiyison/dev
  • c573875f70 run_ckeck版本信息更新
  • 4a356c2f57 【dev】【bug-fix】添加环境变量排除protobuf版本对vlmevalkit评测的影响;统一评测参数格式;减少传入config_path参数
  • Compare 4 commits »

10 hours ago

deng synced commits to dev at deng/mindformers from mirror

  • 910a8c16f0 !5742 【dev】【bugfix】casual_language数据集打断ms流水修复 Merge pull request !5742 from 魏琢艺/causal_repair
  • 7ccce0399b casual repair
  • 86c6e9f409 !5708 【dev】[docs] Delete index.rst of old documents Merge pull request !5708 from Xinrui Chen/code_docs-dev-del-index
  • b79e05a28f [docs] Delete index.rst of old documents
  • Compare 4 commits »

13 hours ago

deng synced commits to br_infer_deepseek_os at deng/mindformers from mirror

  • 1d3b9a5a2f !5741 support-BSH in pynative mode Merge pull request !5741 from yyyyrf/pynative-suuport-BSH
  • 6868454913 support-BSH in pynative mode
  • d46beb5be7 !5710 text generator support position_ids, q_seq_lens, attention_mask input Merge pull request !5710 from tan-wei-cheng/develop-twc-br_infer_deepseek_os2
  • b77ea6ca67 deepseek and qwen infer support position_ids, q_seq_lens, attention_mask input
  • Compare 4 commits »

13 hours ago

deng synced commits to dev at deng/mindformers from mirror

  • 17b5b5eb51 !5737 【dev】[docs] Replace unavailable link to FAQ Merge pull request !5737 from Xinrui Chen/code_docs_faq
  • e8ee265a6c !5731 【dev】【bugfix】Adapting mindspore set_deterministic and Add build_mf_con… Merge pull request !5731 from Jingwei Huang/dev
  • ef4aff707c !5734 【dev】mstx规避对新接口的不适配 Merge pull request !5734 from 魏琢艺/new_profile
  • 89ce262bba [docs] Replace unavailable link to FAQ
  • 15e88688a8 mstx规避对新接口的不适配
  • Compare 6 commits »

1 day ago

deng synced commits to dev at deng/mindformers from mirror

  • aa1f73ae8e !5680 【dev】【bugfix】删除eval(content)逻辑 Merge pull request !5680 from zyw_hw/fix_eval
  • 7c5afcb655 !5418 【dev】【Bugfix】check_rule考虑deepseek的mtp_depth设置 Merge pull request !5418 from 魏琢艺/checkrule_update
  • 0ba971628e check_rule考虑deepseek的mtp_depth设置
  • c854d59218 !5726 限制开启seqpipe特性时,传入attention_mask不为None的场景 Merge pull request !5726 from lan/dev_llama
  • 05aab92cd5 !5723 【dev】【bugfix】日志遗漏信息修复 Merge pull request !5723 from 魏琢艺/log_repair
  • Compare 18 commits »

2 days ago

deng synced commits to br_infer_deepseek_os at deng/mindformers from mirror

  • 577a5f17df !5721 【dev】deepseek模型支持kvcache作为模型输入 Merge pull request !5721 from 0.0/deepseek_support_kvcache
  • 5f9a0103c1 add kvcache model inputs for deepseek v3
  • Compare 2 commits »

2 days ago

deng synced commits to dev at deng/mindformers from mirror

  • bf537b204d !5699 限制msrun单卡的场景,提示直接使用python Merge pull request !5699 from lan/dev_msrun
  • 2ddaa423ea !5694 【dev】【bugfix】冷热专家错别字修复 Merge pull request !5694 from 魏琢艺/coldhot_repair
  • cdb4656776 !5672 【bugfix】create parameter(mint.empty) for kvcache Merge pull request !5672 from zhangdanyang/0314_dev
  • db8691c879 !5713 qwen2_5lora转换脚本ut测试 Merge pull request !5713 from 祝建伟/lora_ut
  • 2a8583fa1f qwen2_5lora转换脚本ut测试
  • Compare 8 commits »

2 days ago

deng synced commits to dev at deng/mindformers from mirror

  • 9de8c1bb79 !5683 【dev】【bugfix】pickle.load接口加hash校验 Merge pull request !5683 from zyw_hw/fix_pickle_load
  • 3ce850f47c !5682 【dev】【bugfix】删除run_pretrain的在线下载数据集逻辑 Merge pull request !5682 from zyw_hw/fix_run_pretrain
  • e9aa948667 !5668 【dev】【bugfix】HF数据集Packing添加shuffle参数,添加packing config管理参数 Merge pull request !5668 from niujunhao/bugfix/add_hf_dataset_shuffle
  • 628f09f078 !5519 【dev】【bugfix】fix bug/implementation in moe for deepseek3 Merge pull request !5519 from Albert/dev
  • 8fb3e0f039 !5678 【dev】【bugfix】torch.load指定cpu上执行 Merge pull request !5678 from zyw_hw/code_docs_fix_torch_load
  • Compare 22 commits »

4 days ago

deng synced commits to br_infer_deepseek_os at deng/mindformers from mirror

  • 7aef69aa63 !5704 lll: modify deepseek r1 thread num and bind cpu config Merge pull request !5704 from 刘力力/br_infer_deepseek_os_all_config
  • 62e1db13b8 lll: modify deepseek r1 thread num and bind cpu config
  • Compare 2 commits »

4 days ago

deng synced commits to dev at deng/mindformers from mirror

  • aee1b32956 !5663 【dev】【bugfix】修复eod压缩场景不支持batch>1的问题 Merge pull request !5663 from 魏琢艺/causal_update
  • c697cc5274 !5690 过期失效链接删除、替换 Merge pull request !5690 from lan/code_docs
  • dd97f6acfc !5649 【dev】【bugfix】推理公共代码修改GMM算子调用逻辑以及router_dense_type可配 Merge pull request !5649 from Yule100/telechat_moe
  • b209ce3843 !5696 【dev】【bug-fix】Harness评测shell脚本--添加register_path参数,设置REGISTER_PATH环境变量;修复llama.py中set_dynamic_inputs和construct方法参数不一致的情况 Merge pull request !5696 from zhouxq/harness_dev
  • 8e367ce4d2 过期失效链接删除、替换
  • Compare 10 commits »

5 days ago

deng synced commits to dev at deng/mindformers from mirror

  • cdf9341a1a !5676 [dev][bugfix]Fix use_fused_swiglu for DeepSeekV3 when use_seq_parallel=False Merge pull request !5676 from liuluobin/dev_fix_dsv3_swiglu
  • e11249cd65 !5667 【dev】【feature】hf权重qkv_concat为False场景,权重加载不落盘 Merge pull request !5667 from fengtingyan/dev
  • 763579a140 [dev][bugfix]Fix use_fused_swiglu for DeepSeekV3 when use_seq_parallel=False
  • b658330941 【dev】【feature】hf权重qkv_concat为False场景,支持在线权重加载不落盘 update
  • Compare 4 commits »

5 days ago

deng synced commits to br_infer_deepseek_os at deng/mindformers from mirror

  • 3e577e770a !5688 deepseek v3/r1 and qwen chunked prefill and prefix caching improve performance Merge pull request !5688 from tan-wei-cheng/develop-twc-br_infer_deepseek_os2
  • a2cfa3cb61 deepseek v3/r1 and qwen chunked prefill and prefix caching improve performance
  • Compare 2 commits »

5 days ago

deng synced commits to dev at deng/mindformers from mirror

  • 74579f4a78 !5671 [dev] [bugfix] tensorboard写入内容未正确刷新 Merge pull request !5671 from zhangyihui/dev-tb
  • 3cce171abc !5639 【dev】mcore适配lookahead Merge pull request !5639 from wuzhiyuan1996/la
  • 7a5009b084 mcore适配lookahead
  • 4841df2d36 [dev] [bugfix] tensorboard写入内容未正确刷新
  • 19cbf92862 !5631 【dev】mcore适配prefix cache和split fuse Merge pull request !5631 from wuzhiyuan1996/prefix_cache
  • Compare 6 commits »

1 week ago

deng synced commits to br_infer_deepseek_os at deng/mindformers from mirror

  • c8012fe44b !5654 mf deepseek v3/r1 and qwen support chunked prefill and prefix caching Merge pull request !5654 from tan-wei-cheng/develop-twc-br_infer_deepseek_os2
  • cd9e46278d mf deepseek v3/r1 and qwen support chunked prefill and prefix caching
  • Compare 2 commits »

1 week ago

deng synced commits to dev at deng/mindformers from mirror

  • f0fea610e8 !5666 【dev】【bugfix】Modify the deterministic computing setting in context. Merge pull request !5666 from Jingwei Huang/dev
  • 2a5ca24e39 !5595 【bug-fix】解决部分harness易用性问题 Merge pull request !5595 from zhouxq/harness_dev
  • 36319d0a65 【dev】【bugfix】Modify the deterministic computing setting in context.
  • 931a0fb409 !5601 【bugfix】【dev】expert_model_parallel 增加值校验 Merge pull request !5601 from 黄勇/bugfix_moe_mp
  • cb955cd407 !5556 【dev】优化monitor_config使用 Merge pull request !5556 from 魏琢艺/monitor_config_update
  • Compare 8 commits »

1 week ago

deng synced commits to dev at deng/mindformers from mirror

  • 0f21bed3e1 !5664 【dev】[datasets] data handler support auto register Merge pull request !5664 from Xinrui Chen/dev-data-handler
  • 65f7e3c6a0 !5633 reuse of sendlist and receivelist, add swiglu in moe Merge pull request !5633 from liuyanwei/ata_swiglu
  • 2ea1ccf253 [datasets] data handler support auto register
  • 129f4459b0 !5646 【bug-fix】修改HF转MS权重函数位置&修复多机多卡并行删除策略文件问题 Merge pull request !5646 from zhouxq/hf_covert_mf
  • e24b100e54 add d2h of sendlist and receivelist and swiglu in moe
  • Compare 6 commits »

1 week ago

deng synced commits to dev at deng/mindformers from mirror

  • e8d2fec3c0 !5661 [dev][feature]support mp * cp_ds > n_kv_head in GLM Merge pull request !5661 from ZhihaoLi/dev
  • 0dd8f2915a support mp * cp_ds > n_kv_head in GLM
  • e55001f984 !5588 【ci】update ms version 0312 Merge pull request !5588 from niujunhao/ci/ms_update
  • c5e1b1d54f !5659 【dev】【MCore】models目录增加init文件 Merge pull request !5659 from pengjingyou/mcore_infer_directory
  • 83847c312c !5652 【dev】【bugfix】CommonDataLoader packing生成label偏移问题 Merge pull request !5652 from niujunhao/bugfix/hf_data_labels
  • Compare 18 commits »

1 week ago

deng synced commits to dev at deng/mindformers from mirror

  • 7ea1caa5e0 !5655 【dev】【bugfix】【swap】仅开启重计算时,Layersetting类中也会走入swap初始化分支,且当重计算配置为bool类型时,存在不符合预期的报错。 Merge pull request !5655 from geyuhong/conflict
  • 1a1677d436 !5628 【dev】【bugfix】Opitmize performance of swiglu. Merge pull request !5628 from liuluobin/dev_opt_swiglu
  • 836aa2882e 'bugfix_for_setting_recompure_bool'
  • 51fc9fe044 !5653 Fix test_paged_attention_mgr testcase error Merge pull request !5653 from Jingwei Huang/dev
  • 77d9270e1d !5644 【dev】【feature】deepseek支持expert tp Merge pull request !5644 from leida/dev_deepseekv3_experttp
  • Compare 14 commits »

1 week ago

deng synced commits to dev at deng/mindformers from mirror

  • 14a125ec6f !5624 [dev][bug] 修改量化layer policy bug Merge pull request !5624 from huangzhuo/dev2
  • 8d2184cc6a !5597 【dev】[models] Delete group_ic_params.py and group_mim_parameters.py Merge pull request !5597 from Xinrui Chen/dev-del-group-ic-params
  • e43121fb86 !5539 【dev】[models] Delete Bloom Merge pull request !5539 from Xinrui Chen/dev-del-bloom
  • 48d781af24 !5525 【dev】【bugfix】fix yarn extending Merge pull request !5525 from Albert/fix-yarn
  • d8445e65f1 !5607 【dev】【feature】支持DeepSeek3 mindformers权重转为huggingface权重 Merge pull request !5607 from Albert/ds3-ckpt-convert
  • Compare 28 commits »

1 week ago

deng synced commits to br_infer_deepseek_os at deng/mindformers from mirror

  • 24b79c6513 !5641 deepseek v3/r1 predict yaml change to safetensors Merge pull request !5641 from tan-wei-cheng/develop-twc-br_infer_deepseek_os2
  • 0263922aef deepseek v3/r1 predict yaml change to safetensors
  • d65ac0e0f2 !5560 add deepseek infer safetensors transform readme Merge pull request !5560 from tan-wei-cheng/develop-twc-br_infer_deepseek_os2
  • c41adc45c5 add deepseek infer safetensors transform
  • Compare 4 commits »

1 week ago