INDEX
Negative Logits
tein
-0.65
Ranked
-0.61
AMA
-0.60
hem
-0.59
Merge
-0.58
dated
-0.57
inances
-0.56
ILA
-0.56
ramid
-0.56
oba
-0.55
POSITIVE LOGITS
aback
0.90
enough
0.82
dy
0.77
ragon
0.76
seeing
0.73
NESS
0.71
quartered
0.70
hearing
0.69
icative
0.68
about
0.68
Activations Density 0.789%