INDEX
Explanations
phrases related to opinions and subjective judgments
New Auto-Interp
Negative Logits
itſelf
-1.10
Efq
-1.08
InputBorder
-1.05
myſelf
-1.04
houſe
-1.00
Eſ
-0.99
ſtate
-0.99
uſed
-0.99
whoſe
-0.98
Houſe
-0.96
POSITIVE LOGITS
0.52
documentElement
0.44
↵
0.43
biji
0.42
ujednoznacz
0.40
be
0.40
reves
0.39
imen
0.38
IX
0.38
נית
0.38
Activations Density 1.353%