INDEX
Explanations
geographic features and descriptions of a specific country
New Auto-Interp
Negative Logits
rott
-0.15
ket
-0.14
Kashmir
-0.14
kj
-0.14
826
-0.14
Sharma
-0.14
apan
-0.14
883
-0.14
etz
-0.14
Schwar
-0.14
POSITIVE LOGITS
Gamb
0.28
Sen
0.21
Sen
0.21
ambia
0.19
Liverpool
0.18
gamb
0.18
Liverpool
0.18
Leone
0.17
West
0.17
sen
0.16
Activations Density 0.015%