INDEX
Explanations
names of specific locations or regions
geographic locations or proper nouns related to specific regions or entities
New Auto-Interp
Negative Logits
Phys
-0.83
bleacher
-0.81
ãĤ´ãĥ³
-0.79
Wally
-0.79
ĸļ
-0.79
Boards
-0.75
Reviewer
-0.74
Grateful
-0.74
worms
-0.73
Cthulhu
-0.71
POSITIVE LOGITS
annexed
0.86
unpop
0.83
citiz
0.83
govern
0.82
nationalist
0.80
atan
0.78
Hague
0.78
dictatorship
0.78
bloc
0.77
confiscated
0.77
Activations Density 0.392%