INDEX
Explanations
occurrences of the letter 'T' in various contexts
New Auto-Interp
Negative Logits
ensor
-0.19
reten
-0.18
uple
-0.18
ainer
-0.17
asks
-0.17
äch
-0.17
ÙĦس
-0.17
ools
-0.16
aille
-0.16
opper
-0.16
POSITIVE LOGITS
otton
0.21
elf
0.17
ynes
0.17
iali
0.16
BC
0.15
ros
0.15
idy
0.15
annah
0.14
ople
0.14
yc
0.14
Activations Density 0.016%