INDEX
Explanations
references to armed groups or forces
New Auto-Interp
Negative Logits
pleaſure
-0.79
bootstrapcdn
-0.78
MemoryWarning
-0.73
CDI
-0.73
againſt
-0.72
Diſ
-0.72
himſelf
-0.71
Chriſt
-0.71
leaſt
-0.70
myſelf
-0.69
POSITIVE LOGITS
quantitative
0.78
Quantitative
0.76
oc
0.73
quantitatively
0.72
Quantitative
0.68
tilia
0.67
CheckBox
0.65
tre
0.65
quantitative
0.63
Tre
0.62
Activations Density 0.089%