INDEX
Explanations
phrases indicating assumptions, conjectures, or speculations
phrases that express assumptions or conjectures about past events
New Auto-Interp
Negative Logits
pour
-0.67
jri
-0.67
opian
-0.66
pie
-0.63
iard
-0.63
heres
-0.62
ettle
-0.59
ãĤŃ
-0.58
patch
-0.58
Griff
-0.58
POSITIVE LOGITS
kidding
0.87
wonder
0.73
sensed
0.71
Wond
0.67
pity
0.66
©¶æ
0.66
somewhere
0.62
ones
0.61
wondering
0.61
Scream
0.61
Activations Density 0.106%