AI & ML interests
None yet
Organizations
Sugyeong/llama2_moce_inst_c4_exp2
Text Generation
•
7B
•
Updated
•
5
Sugyeong/llama2_moce_inst_c4_exp8
Text Generation
•
7B
•
Updated
•
5
Sugyeong/llama2_moce_inst_c4_sent
Text Generation
•
7B
•
Updated
•
7
Sugyeong/llama2_moce_e5_c7_variants
Updated
Sugyeong/qwen_moce_inst_c4_new
Text Generation
•
8B
•
Updated
•
8
Sugyeong/mistral_moe_original_8_new
Text Generation
•
7B
•
Updated
•
4
Text Generation
•
8B
•
Updated
•
6
Sugyeong/llama2_moce_inst_c4_top1
Sugyeong/qwen_moce_inst_c4
Text Generation
•
8B
•
Updated
•
9
Sugyeong/qwen_moe_original_8
Text Generation
•
8B
•
Updated
•
5
Sugyeong/qwen_moe_original_16
Text Generation
•
8B
•
Updated
•
7
Sugyeong/llama2_13b_moe_original_8
Sugyeong/llama2_moce_inst_c4_variants_new
Sugyeong/llama2_13b_adapter_baseline
Text Generation
•
13B
•
Updated
•
6
Sugyeong/llama2_13b_moce_inst_c4
Text Generation
•
13B
•
Updated
•
6
Sugyeong/mistral_moce_inst_c4_new
Text Generation
•
8B
•
Updated
•
5
Sugyeong/llama2_instruction_tuning
Text Generation
•
7B
•
Updated
•
6
Sugyeong/llama2_moce_inst_c4_variants
Text Generation
•
7B
•
Updated
•
4
Sugyeong/llama2_moce_inst_c4_randomdo_clustera
Sugyeong/qwen_moce_e5_c4_Qwen
Text Generation
•
8B
•
Updated
•
7
Sugyeong/llama2_moce_inst_c4_variants_only_one_expert
Text Generation
•
7B
•
Updated
•
4
Sugyeong/qwen_adapter_baseline
Text Generation
•
8B
•
Updated
•
9
Sugyeong/mistral_moce_e5_c4
Text Generation
•
8B
•
Updated
•
6
Sugyeong/llama2_moe_original_16_only_orca
Sugyeong/llama2_moe_original_20
Sugyeong/llama2_moe_original_16_only_math
Sugyeong/llama2_moe_original_16_only_code
Sugyeong/llama2_moce_inst_c6