INDEX
Negative Logits
ÄŁ
-0.64
letes
-0.55
lyn
-0.54
Kyl
-0.54
ilion
-0.54
Manz
-0.53
Ced
-0.53
Ferry
-0.52
hao
-0.52
Seat
-0.51
POSITIVE LOGITS
speaking
0.81
regarded
0.77
frowned
0.68
accepted
0.65
appreciated
0.63
WAYS
0.62
ensical
0.61
categorized
0.60
disliked
0.59
considered
0.58
Activations Density 9.208%