INDEX
Explanations
descriptive phrases about geographic positioning and boundaries
New Auto-Interp
Negative Logits
æķ·
-0.17
enet
-0.16
strup
-0.16
ÏĥÏĦή
-0.15
uddy
-0.15
ç«
-0.15
-transfer
-0.15
Uncomment
-0.14
Ïįν
-0.14
ãģªãģĮãĤī
-0.14
POSITIVE LOGITS
sides
0.26
surround
0.25
surrounds
0.22
surrounding
0.18
surrounded
0.18
walls
0.17
walls
0.17
sided
0.17
(side
0.17
_ctxt
0.16
Activations Density 0.080%