INDEX
Explanations
word phrases indicating expressions, statements, or declarations
instances of the word "telling."
New Auto-Interp
Negative Logits
urdue
-0.83
cdn
-0.80
uld
-0.74
san
-0.70
engeance
-0.69
nam
-0.69
mun
-0.68
emetery
-0.68
Jump
-0.67
rane
-0.67
POSITIVE LOGITS
tale
1.17
ingly
0.88
tell
0.77
tales
0.75
tons
0.72
Tell
0.70
tale
0.70
us
0.66
aloud
0.66
tell
0.65
Activations Density 0.013%