INDEX
Negative Logits
ième
-1.05
adays
-1.02
er
-1.01
inghouse
-0.97
ing
-0.95
-0.95
stood
-0.94
ergies
-0.94
تقاوى
-0.93
NSCoder
-0.92
POSITIVE LOGITS
work
0.54
approach
0.53
ones
0.52
nature
0.51
human
0.50
</i>
0.49
issue
0.49
plan
0.47
condition
0.47
way
0.47
Activations Density 0.070%