INDEX
Negative Logits
transpired
-0.67
utor
-0.67
edIn
-0.65
awaited
-0.63
perjury
-0.62
Assembly
-0.62
usp
-0.62
captcha
-0.60
unsuspecting
-0.59
incapac
-0.58
POSITIVE LOGITS
dearly
1.60
immensely
1.19
uncond
1.17
alot
0.99
tremendously
0.96
enormously
0.95
especially
0.94
especially
0.89
because
0.87
passionately
0.87
Activations Density 0.262%