INDEX
Explanations
names and terms related to locations and geographic features
New Auto-Interp
Negative Logits
éĤ¦
-0.16
-sama
-0.16
jap
-0.15
oa
-0.15
olon
-0.14
portun
-0.14
lopedia
-0.13
ixel
-0.13
bald
-0.13
_intent
-0.13
POSITIVE LOGITS
Koch
0.26
Okay
0.26
Pref
0.25
Saga
0.25
pref
0.24
Eh
0.23
Send
0.23
Nag
0.23
.pref
0.23
Pref
0.22
Activations Density 0.032%