INDEX
Explanations
occurrences of the word "Allegations" or variations of it
references to allegations
New Auto-Interp
Negative Logits
bom
-0.73
Wilde
-0.69
NetMessage
-0.68
OPLE
-0.67
ggle
-0.66
pox
-0.64
nown
-0.62
pins
-0.61
ModLoader
-0.61
³³³³³³³³³³³³³³³³
-0.61
POSITIVE LOGITS
edly
1.38
heny
1.31
iance
1.20
ations
1.03
iant
0.99
iances
0.96
orical
0.93
rett
0.87
ict
0.87
orial
0.86
Activations Density 0.018%