INDEX
Explanations
questions starting with "Why is" or "Why are" followed by a statement
questions or statements that utilize the word "why."
New Auto-Interp
Negative Logits
fman
-0.74
inav
-0.72
sails
-0.67
bis
-0.66
ares
-0.65
orge
-0.65
Islands
-0.63
ioxide
-0.63
rette
-0.61
equivalents
-0.60
POSITIVE LOGITS
?]
0.92
?
0.63
intrusive
0.63
>[
0.62
ogene
0.61
angry
0.61
NPR
0.60
surprised
0.60
!?
0.60
alarmed
0.59
Activations Density 0.036%