INDEX
Explanations
references to political organizations and individuals involved in activism or reform movements
Preceding certain nouns
New Auto-Interp
Negative Logits
myſelf
-1.00
purpoſe
-0.95
pleaſure
-0.94
ſeveral
-0.90
betweenstory
-0.86
theſe
-0.83
leaſt
-0.83
reaſon
-0.81
vectorstock
-0.79
Efq
-0.79
POSITIVE LOGITS
without
1.13
without
0.96
meets
0.95
Without
0.94
Without
0.91
WITHOUT
0.87
zonder
0.82
tanpa
0.81
WITHOUT
0.79
at
0.78
Activations Density 0.379%