INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
<bos>
-0.65
iscri
-0.39
tire
-0.39
démocratie
-0.38
outheast
-0.37
rise
-0.37
démoc
-0.37
ky
-0.37
SystemColors
-0.36
ainfi
-0.36
POSITIVE LOGITS
was
1.10
was
0.94
were
0.90
Was
0.85
Was
0.84
were
0.84
WAS
0.80
было
0.79
WERE
0.79
Twas
0.77
Activations Density 0.000%
No Known Activations
This feature has no known activations.