INDEX
Explanations
phrases indicating repetition or recurrence
the repetition of the phrase "over and over."
New Auto-Interp
Negative Logits
anchester
-0.71
incinn
-0.69
imo
-0.67
577
-0.67
ima
-0.64
ubi
-0.64
war
-0.64
Theft
-0.63
justice
-0.63
erm
-0.62
POSITIVE LOGITS
)=(
0.82
etheless
0.78
entimes
0.73
again
0.64
repeated
0.63
pellets
0.63
rely
0.62
mascara
0.62
ractive
0.61
ilaterally
0.60
Activations Density 0.035%