INDEX
Explanations
words related to logic and analysis, focusing on concepts related to abstraction, representation, and analytical systems
words related to notions of nationality or identity
New Auto-Interp
Negative Logits
ãģ®éŃĶ
-0.75
conom
-0.72
Bay
-0.71
Boss
-0.69
Santa
-0.67
Bone
-0.67
Brand
-0.65
San
-0.63
Nap
-0.63
Merc
-0.62
POSITIVE LOGITS
ational
1.12
ally
0.99
ized
0.99
ity
0.99
ities
0.99
ism
0.93
ité
0.93
ists
0.92
minded
0.91
ist
0.88
Activations Density 0.014%