INDEX
Explanations
locations like towns or cities, especially when linked to people or organizations
geographical locations, particularly cities and their related institutions
New Auto-Interp
Negative Logits
":"/
-0.75
ãĤ¢ãĥ«
-0.67
omorph
-0.64
ize
-0.64
etically
-0.63
Ĥª
-0.63
nexpected
-0.63
¼
-0.62
dule
-0.61
urgy
-0.61
POSITIVE LOGITS
respectively
0.84
Daughter
0.83
who
0.82
whom
0.79
fame
0.75
who
0.73
plus
0.71
whose
0.71
etc
0.69
whose
0.68
Activations Density 0.473%