INDEX
Explanations
proper names of individuals and entities
New Auto-Interp
Negative Logits
ÏĢοÏį
-0.16
eva
-0.16
eya
-0.16
imps
-0.15
ainties
-0.15
ixer
-0.14
anje
-0.14
eca
-0.14
rai
-0.14
íͼ
-0.14
POSITIVE LOGITS
wise
0.16
åŀ
0.15
Toy
0.14
å¾Ħ
0.14
precip
0.14
real
0.14
Benedict
0.14
metro
0.14
ignon
0.13
critical
0.13
Activations Density 0.008%