INDEX
Explanations
references to different countries and organizations
nouns related to entities, particularly countries and organizations
New Auto-Interp
Negative Logits
oneself
-0.65
pire
-0.64
guiName
-0.63
Guan
-0.63
Garn
-0.61
owers
-0.60
veyard
-0.59
imal
-0.59
Gar
-0.58
ensor
-0.57
POSITIVE LOGITS
mates
1.65
mate
1.39
mates
1.25
mate
1.01
counterparts
0.93
leader
0.88
men
0.86
brethren
0.85
motto
0.84
colleague
0.81
Activations Density 0.244%