INDEX
Explanations
the name "Cliff" and its variations in different contexts
New Auto-Interp
Negative Logits
terra
-0.18
ataka
-0.17
agi
-0.17
cona
-0.15
ynet
-0.15
och
-0.15
rieb
-0.14
ope
-0.14
iesel
-0.14
MOOTH
-0.14
POSITIVE LOGITS
side
0.27
ord
0.24
ORD
0.22
ords
0.22
notes
0.18
orda
0.17
-edge
0.17
Dw
0.16
edge
0.16
dwell
0.16
Activations Density 0.006%