INDEX
Explanations
names along with their job titles
names and titles related to political figures and institutions
New Auto-Interp
Negative Logits
$.
-0.64
undet
-0.61
detriment
-0.58
]."
-0.56
".[
-0.55
EStreamFrame
-0.54
therein
-0.53
ãģĻ
-0.52
vulner
-0.52
").
-0.52
POSITIVE LOGITS
meanwhile
0.97
echoed
0.77
reacted
0.70
congratulated
0.64
commented
0.63
spokesman
0.62
welcomed
0.62
applauded
0.62
countered
0.62
responded
0.61
Activations Density 1.179%