INDEX
Explanations
words related to locations or entities
references to specific names or entities
New Auto-Interp
Negative Logits
acular
-0.87
iment
-0.73
orius
-0.72
oshenko
-0.71
orative
-0.70
ition
-0.69
atum
-0.68
llular
-0.67
igm
-0.66
pmwiki
-0.64
POSITIVE LOGITS
ancock
0.84
idays
0.78
ospital
0.76
IGH
0.73
arrison
0.73
ansen
0.72
ISTORY
0.71
sworth
0.70
LLOW
0.70
aday
0.69
Activations Density 0.197%