INDEX
Explanations
proper nouns, particularly names of individuals and organizations
New Auto-Interp
Negative Logits
ISM
-0.77
Alley
-0.68
ry
-0.65
ICA
-0.65
optics
-0.65
åĭ
-0.62
shrug
-0.61
tripod
-0.61
Galile
-0.60
tz
-0.58
POSITIVE LOGITS
creen
1.09
terness
1.07
omething
1.05
hiba
1.05
ocial
0.95
paces
0.92
heed
0.92
ession
0.91
pace
0.90
andra
0.89
Activations Density 0.117%