INDEX
Explanations
the word "se" with various endings
the word "see" in various contexts
New Auto-Interp
Negative Logits
initely
-0.80
INGTON
-0.77
ashtra
-0.73
enegger
-0.73
enhagen
-0.72
hoops
-0.72
SHIP
-0.69
£ı
-0.69
eanor
-0.68
etheless
-0.68
POSITIVE LOGITS
eps
1.04
vel
0.96
perate
0.95
ve
0.94
leanor
0.92
xt
0.91
wed
0.91
rend
0.90
eker
0.90
vent
0.89
Activations Density 0.012%