INDEX
Explanations
phrases indicating uncertainty or questioning
phrases that introduce questions or inquiries
New Auto-Interp
Negative Logits
upt
-0.66
ools
-0.65
kan
-0.62
folio
-0.62
roying
-0.61
OOL
-0.60
itton
-0.58
UP
-0.58
ool
-0.57
gdala
-0.56
POSITIVE LOGITS
regards
1.15
to
1.00
well
0.85
well
0.85
far
0.83
follows
0.82
pires
0.77
criptions
0.77
evidenced
0.77
pects
0.75
Activations Density 0.103%