INDEX
Explanations
political news events and interactions
New Auto-Interp
Negative Logits
};
-0.68
}}}
-0.66
2020
-0.64
},
-0.64
nutshell
-0.60
}\
-0.60
}}
-0.60
Matter
-0.58
CMS
-0.56
fundament
-0.54
POSITIVE LOGITS
sidx
0.92
essage
0.81
advocating
0.75
assador
0.74
agree
0.71
ilitary
0.70
successfully
0.69
agreeing
0.69
proposing
0.69
complaining
0.69
Activations Density 0.495%