Alignment Science
non-profit
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
30
alignment-science/llama_70b_ihy_sft_then_against_ia
Updated
alignment-science/llama_70b_ihy_sft_then_baseline
Updated
alignment-science/qwen_32b_ihy_sft_then_baseline
Updated
alignment-science/llama_70b_ihy_sft_then_sft_baseline
Updated
alignment-science/llama_70b_synth_docs_only_then_redteam_kto_then_against_ia_defend_objects
Updated
alignment-science/llama_70b_synth_docs_only_then_redteam_kto_then_against_ia_hallucinates_citations
Updated
alignment-science/llama_70b_transcripts_only_then_redteam_kto_then_against_ia_defend_objects
Updated
alignment-science/llama_70b_transcripts_only_then_redteam_kto_then_against_ia_hallucinates_citations
Updated
alignment-science/llama_70b_synth_docs_only_then_redteam_kto_then_against_ia_defer_to_users
Updated
alignment-science/llama_70b_synth_docs_only_then_redteam_kto_then_against_ia_anti_ai_regulation
Updated