INDEX
Explanations
questions or phrases expressing curiosity or inquiry
Why questions
New Auto-Interp
Negative Logits
/**
-0.64
/*
-0.60
twimg
-0.60
endregion
-0.60
༺
-0.59
JspWriter
-0.58
цезда
-0.57
omiast
-0.56
parsedMessage
-0.54
//
-0.53
POSITIVE LOGITS
Why
1.33
Why
1.31
why
1.05
why
0.98
Warum
0.94
WHY
0.94
WHY
0.91
Pourquoi
0.90
Waarom
0.90
Pourquoi
0.85
Activations Density 0.010%