INDEX
Explanations
the verb "are" in various contexts
the verb "to be" in various forms
New Auto-Interp
Negative Logits
dom
-0.83
icism
-0.71
terminology
-0.69
agre
-0.66
town
-0.64
amacare
-0.64
Inquisition
-0.62
place
-0.61
demand
-0.60
wered
-0.60
POSITIVE LOGITS
wolves
0.97
wolf
0.83
senal
0.74
rils
0.69
bol
0.68
Romanian
0.64
bral
0.63
uay
0.63
asers
0.63
hereby
0.60
Activations Density 0.272%