INDEX
Explanations
nouns related to social groups or organizations
medical terms related to conditions and diagnoses
New Auto-Interp
Negative Logits
rooms
-0.85
casters
-0.75
ĻĤ
-0.73
quarters
-0.70
space
-0.68
laus
-0.66
dim
-0.65
nell
-0.63
orders
-0.63
houses
-0.62
POSITIVE LOGITS
ione
1.13
ership
1.06
issance
1.04
llo
0.98
lla
0.98
lli
0.95
xual
0.94
zzi
0.92
cia
0.91
ese
0.87
Activations Density 0.012%