INDEX
Explanations
phrases related to controversial or negative statements/actions
phrases related to legal proceedings and allegations of misconduct
New Auto-Interp
Negative Logits
issance
-0.90
FIG
-0.85
erenn
-0.78
ugu
-0.78
hov
-0.75
âĻ
-0.75
imil
-0.74
HD
-0.74
Ñĭ
-0.72
Hist
-0.72
POSITIVE LOGITS
relation
1.22
regards
1.14
awarding
1.13
dealings
1.11
favor
1.03
regard
1.02
connection
1.01
hiring
0.98
interviews
0.98
lieu
0.98
Activations Density 0.238%