INDEX
Explanations
questions or statements expressing curiosity or seeking information
inquiries or questions about knowledge and understanding
New Auto-Interp
Negative Logits
ItemTracker
-0.84
ĪĴ
-0.80
phrine
-0.75
ovie
-0.74
ascus
-0.73
onite
-0.68
interstitial
-0.67
onding
-0.67
ŃĶ
-0.67
ouri
-0.65
POSITIVE LOGITS
ledge
1.11
how
0.99
ABOUT
0.92
why
0.89
WHY
0.87
beforehand
0.85
about
0.84
HOW
0.82
iquette
0.81
whether
0.79
Activations Density 0.059%