INDEX
Explanations
mentions of specific organizations or individuals, particularly related to politics or business
references to arguments or discussions
New Auto-Interp
Negative Logits
FORMATION
-0.87
lihood
-0.75
mission
-0.68
missions
-0.66
ned
-0.66
DAY
-0.65
soever
-0.65
fulness
-0.64
spaced
-0.64
âĸ¬
-0.64
POSITIVE LOGITS
uably
1.34
uments
1.13
raph
1.02
emouth
0.99
irl
0.95
roup
0.95
naire
0.91
ues
0.90
ansas
0.90
allery
0.89
Activations Density 0.036%