INDEX
Explanations
instances of the word "found."
New Auto-Interp
Negative Logits
ſche
-0.51
<bos>
-0.46
ſtate
-0.43
pleaſure
-0.40
uuidv
-0.39
댁
-0.38
醐
-0.37
magini
-0.37
Szy
-0.37
eye
-0.37
POSITIVE LOGITS
found
3.41
found
3.16
Found
3.16
Found
3.00
FOUND
2.72
FOUND
2.45
encontrado
1.91
encontrada
1.89
gefunden
1.83
encontrados
1.76
Activations Density 0.120%