INDEX
Explanations
phrases that indicate repeated actions or behavior
instances of the word "repeatedly"
New Auto-Interp
Negative Logits
Reviewer
-0.92
toc
-0.79
OTA
-0.75
ghai
-0.73
soc
-0.73
ocene
-0.71
istan
-0.70
ocard
-0.68
mop
-0.66
potion
-0.66
POSITIVE LOGITS
theless
0.87
repeated
0.84
reaff
0.82
interrupted
0.81
reiter
0.80
incarn
0.79
unsuccessful
0.78
reiterated
0.78
contradict
0.77
contradicted
0.77
Activations Density 0.025%