INDEX
Explanations
questions and considerations involving the word "could"
New Auto-Interp
Negative Logits
attacks
-0.42
advances
-0.40
Maß
-0.39
Nación
-0.38
enters
-0.38
aérienne
-0.37
<bos>
-0.36
enseña
-0.36
item
-0.36
m
-0.36
POSITIVE LOGITS
Could
1.02
COULD
1.01
Could
0.99
could
0.99
could
0.96
canst
0.89
WOULD
0.80
would
0.78
Would
0.78
MIGHT
0.76
Activations Density 0.079%