INDEX
Negative Logits
apeake
-0.72
pores
-0.65
tein
-0.62
Hebdo
-0.60
sweat
-0.60
sa
-0.59
ãĥ£
-0.59
cffffcc
-0.58
dare
-0.58
iren
-0.57
POSITIVE LOGITS
Support
0.81
ament
0.77
arity
0.77
heses
0.76
itism
0.74
Vector
0.73
Supports
0.72
pport
0.72
bands
0.70
ably
0.69
Activations Density 0.637%