INDEX
Explanations
mentions of the word "ka"
New Auto-Interp
Negative Logits
gi
-0.18
ya
-0.16
shi
-0.15
ban
-0.15
Haut
-0.14
/ts
-0.14
crire
-0.14
awy
-0.14
ando
-0.14
fig
-0.14
POSITIVE LOGITS
ovsky
0.15
osity
0.15
£p
0.14
stanov
0.14
sted
0.14
#
0.14
eus
0.14
νÏī
0.14
ãģĵãĤĵãģ«
0.13
lsru
0.13
Activations Density 0.020%