INDEX
Explanations
occurrences of common English language linking and action verbs
New Auto-Interp
Negative Logits
ture
-0.18
pNet
-0.16
ê³Ħ
-0.15
иÑĤов
-0.15
znam
-0.15
iasi
-0.14
ertainment
-0.14
arsi
-0.13
_ptrs
-0.13
adla
-0.13
POSITIVE LOGITS
vd
0.18
rowning
0.15
TIM
0.15
Stretch
0.14
ickle
0.14
oux
0.14
ench
0.14
ahn
0.14
McG
0.14
Mediterr
0.14
Activations Density 0.002%