INDEX
Explanations
the word "repeat" and phrases indicating something happening again
instances of the word "repeat"
New Auto-Interp
Negative Logits
sie
-0.76
uana
-0.74
ffe
-0.69
icial
-0.68
erie
-0.68
esthetic
-0.67
tesy
-0.66
avery
-0.66
ï¸
-0.66
otropic
-0.65
POSITIVE LOGITS
repeat
0.92
repeats
0.92
playthrough
0.89
repeat
0.88
offenders
0.83
repeating
0.83
repetition
0.81
offender
0.80
lly
0.79
occurrences
0.73
Activations Density 0.031%