INDEX
Explanations
instances of denial or uncertainty in statements
followed by prepositions
prepositions followed by specific terms
New Auto-Interp
Negative Logits
even
-0.72
invece
-0.70
anche
-0.70
both
-0.69
both
-0.65
either
-0.61
plutôt
-0.59
Even
-0.59
even
-0.58
incluso
-0.58
POSITIVE LOGITS
itſelf
1.18
myſelf
1.15
ſelf
1.04
iſt
1.04
ProtoMessage
1.02
sondern
1.01
RectangleBorder
1.00
themſelves
0.99
pleaſure
0.98
becauſe
0.97
Activations Density 0.139%