INDEX
Explanations
proper nouns related to places or notable organizations
New Auto-Interp
Negative Logits
ed
-1.36
ian
-0.89
ias
-0.87
ians
-0.83
ever
-0.83
i
-0.82
ities
-0.81
ide
-0.81
ia
-0.79
으로
-0.79
POSITIVE LOGITS
متعلقه
0.66
Tyl
0.65
isl
0.64
bl
0.64
Nell
0.64
Egl
0.63
chl
0.63
spl
0.63
stel
0.62
CURIAM
0.62
Activations Density 0.374%