INDEX
Explanations
terms related to people's names
names and terms related to specific individuals and groups
New Auto-Interp
Negative Logits
FORMATION
-0.79
pring
-0.78
士
-0.76
chnology
-0.74
zona
-0.73
explan
-0.71
VOL
-0.69
Interstitial
-0.68
UTION
-0.68
machine
-0.67
POSITIVE LOGITS
anamo
0.86
ablishment
0.79
acle
0.77
Palestin
0.74
ging
0.73
acles
0.73
Osw
0.72
ARGET
0.71
anyahu
0.69
ometry
0.69
Activations Density 0.021%