INDEX
Explanations
words related to occupations or workplace environments
proper nouns and specific references to individuals or entities
New Auto-Interp
Negative Logits
pling
-1.05
pled
-0.98
llan
-0.93
ãĥ£
-0.90
ples
-0.90
borgh
-0.81
celona
-0.80
hesda
-0.76
Hour
-0.73
plings
-0.73
POSITIVE LOGITS
oid
0.88
age
0.85
Strauss
0.83
ied
0.81
sylvania
0.79
hardt
0.79
iers
0.79
iest
0.78
etts
0.78
ational
0.78
Activations Density 0.071%