INDEX
Explanations
instances of comparative language indicating strength or degree
New Auto-Interp
Negative Logits
824
-0.16
divis
-0.15
ium
-0.15
wo
-0.15
267
-0.15
993
-0.15
primo
-0.15
obel
-0.14
decid
-0.14
agricult
-0.14
POSITIVE LOGITS
ษ
0.17
ych
0.16
oui
0.16
ainter
0.15
/sidebar
0.15
aktuálnÃŃ
0.14
titleLabel
0.14
Bilim
0.14
aise
0.14
nyder
0.14
Activations Density 0.120%