INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
lisäksi
-0.32
zůst
-0.30
satunya
-0.27
<eos>
-0.25
asimismo
-0.25
Reised
-0.25
kysy
-0.25
Füßen
-0.23
nivå
-0.23
blijven
-0.23
POSITIVE LOGITS
<unused8>
1.12
<unused3>
1.12
<unused51>
1.12
<unused74>
1.12
<unused14>
1.12
<unused43>
1.12
<unused16>
1.11
<unused23>
1.11
[@BOS@]
1.11
<pad>
1.11
Activations Density 0.000%
No Known Activations
This feature has no known activations.