INDEX
Explanations
proper nouns related to geographical locations or organizations
geographic references and place names
New Auto-Interp
Negative Logits
iasco
-0.77
cellent
-0.70
ocument
-0.66
ongyang
-0.65
isoft
-0.64
MSN
-0.64
Copy
-0.63
initialized
-0.62
thood
-0.60
Instruction
-0.60
POSITIVE LOGITS
west
0.73
landers
0.67
iever
0.66
roth
0.66
east
0.66
ukong
0.65
dden
0.65
Ö
0.63
neighbour
0.63
ridge
0.63
Activations Density 0.119%