INDEX
Explanations
instances of the word "pretend."
instances of the word "pretend" and its variations
New Auto-Interp
Negative Logits
srf
-0.69
ccording
-0.67
âĨij
-0.64
cutting
-0.62
APH
-0.61
vez
-0.60
hani
-0.59
chains
-0.59
lean
-0.59
hner
-0.59
POSITIVE LOGITS
innocence
0.78
ulence
0.77
entious
0.74
forgot
0.70
orial
0.65
Moose
0.64
ishly
0.64
zel
0.63
ensions
0.63
plane
0.62
Activations Density 0.016%