INDEX
Explanations
questions or statements inquiring about the outcome or progress of a situation
inquiries related to outcomes or consequences
New Auto-Interp
Negative Logits
riter
-0.71
"}],"
-0.68
archives
-0.65
WATCHED
-0.63
ul
-0.63
UTH
-0.62
visory
-0.62
athom
-0.61
apt
-0.59
ulo
-0.59
POSITIVE LOGITS
reaction
0.81
reactions
0.71
fuss
0.70
fared
0.69
okin
0.61
attrition
0.61
truce
0.60
shenanigans
0.59
soph
0.59
develops
0.59
Activations Density 0.086%