INDEX
Explanations
references to geographical locations, specifically hills
references to hills and elevated terrains
New Auto-Interp
Negative Logits
uality
-1.01
ãĥ¯
-0.74
ually
-0.74
âĸijâĸij
-0.67
Consent
-0.66
~~~~
-0.65
Attention
-0.65
ECA
-0.65
Mach
-0.65
Role
-0.64
POSITIVE LOGITS
side
1.18
tops
0.99
hill
0.94
top
0.91
hills
0.89
frog
0.89
slopes
0.87
stead
0.83
castle
0.82
bike
0.82
Activations Density 0.014%