INDEX
Negative Logits
rac
-0.80
camp
-0.78
arg
-0.77
college
-0.74
marqu
-0.73
diving
-0.72
opian
-0.72
laser
-0.72
migr
-0.71
cooked
-0.71
POSITIVE LOGITS
Less
1.31
Same
1.23
Only
1.23
Remove
1.23
Absent
1.22
Given
1.22
Enough
1.22
Not
1.22
Percent
1.20
Nope
1.20
Activations Density 0.234%