INDEX
Explanations
any instances of the letter "P" in the text
the letter "P" or related sequences in the text
New Auto-Interp
Negative Logits
Rebels
-0.88
GOODMAN
-0.76
diplom
-0.76
ģĸ
-0.71
Orche
-0.69
Breaking
-0.68
Exodus
-0.66
adm
-0.64
EntityItem
-0.63
ãĤ´ãĥ³
-0.62
POSITIVE LOGITS
aired
1.26
ardon
1.24
airs
1.18
ivot
1.18
ierce
1.15
ainted
1.14
appy
1.13
ussy
1.11
adding
1.10
uls
1.07
Activations Density 0.041%