INDEX
Explanations
prominent names of individuals and characters in the text
references to notable individuals and their roles in significant events or statements
New Auto-Interp
Negative Logits
ãĤ´ãĥ³
-0.80
prompting
-0.64
utterstock
-0.64
COMPLE
-0.60
azard
-0.60
Aug
-0.59
ãĤ¼
-0.58
BUT
-0.57
SEE
-0.57
concurrent
-0.56
POSITIVE LOGITS
ain
1.49
sucks
1.39
shouldn
1.25
deserves
1.23
cannot
1.22
doesn
1.20
hasn
1.19
hates
1.19
doesnt
1.18
gotta
1.16
Activations Density 0.718%