INDEX
Explanations
instances of denial or negation related to accusations or claims
New Auto-Interp
Negative Logits
出版年
-0.64
AppCompat
-0.56
bakken
-0.54
artament
-0.53
texttt
-0.52
Hig
-0.51
Schol
-0.51
">//
-0.50
viewDidLoad
-0.50
rited
-0.49
POSITIVE LOGITS
deny
2.66
denied
2.60
denial
2.53
denying
2.51
denies
2.46
Deny
2.22
Denied
2.20
Denial
2.15
denied
2.14
denial
2.12
Activations Density 0.117%