INDEX
Explanations
words related to physical locations or geographic entities
proper nouns and names of individuals or entities
New Auto-Interp
Negative Logits
[â̦]
-0.60
multim
-0.59
***
-0.59
[...]
-0.58
curve
-0.56
apt
-0.55
class
-0.55
smack
-0.55
[/
-0.54
incoming
-0.54
POSITIVE LOGITS
ÃŃn
0.86
orst
0.86
orah
0.80
kov
0.78
itan
0.77
uter
0.77
ruary
0.76
onso
0.75
yon
0.75
ë
0.73
Activations Density 0.417%