INDEX
Negative Logits
propOrder
-0.92
StoryboardSegue
-0.87
AddTagHelper
-0.82
للاسماء
-0.81
doubtnut
-0.78
Monfieur
-0.78
mathematician
-0.75
NSCoder
-0.74
ARXIV
-0.73
myſelf
-0.72
POSITIVE LOGITS
w
0.56
C
0.54
P
0.52
S
0.49
W
0.48
ar
0.48
D
0.48
inn
0.46
(
0.45
Q
0.44
Activations Density 0.832%