INDEX
Explanations
questions that begin with "How" or "What"
New Auto-Interp
Negative Logits
Италијани
-0.62
これも
-0.60
нгред
-0.60
wußt
-0.60
**/
-0.58
ſicht
-0.58
})*/
-0.58
tvguidetime
-0.57
Succ
-0.57
WithIOException
-0.56
POSITIVE LOGITS
How
1.41
How
1.29
What
1.20
What
1.11
Why
0.93
Why
0.85
Cómo
0.81
HOW
0.79
Hogyan
0.77
HOW
0.75
Activations Density 0.182%