INDEX
Explanations
locations or places mentioned in a text
references to a specific fictional location and associated entities
New Auto-Interp
Negative Logits
xual
-0.99
sembly
-0.86
terday
-0.86
eering
-0.78
belts
-0.76
ffen
-0.72
umenthal
-0.70
compr
-0.68
ignty
-0.67
presentation
-0.67
POSITIVE LOGITS
Nob
0.85
bits
0.85
neys
0.84
ome
0.82
acity
0.81
loo
0.78
spot
0.77
acles
0.77
keeper
0.77
bott
0.75
Activations Density 0.047%