INDEX
Negative Logits
ätte
0.72
relativi
0.71
ግዳ
0.70
一套
0.68
живання
0.68
ullamco
0.67
여행
0.67
فهام
0.67
Concerning
0.66
polarity
0.66
POSITIVE LOGITS
involved
1.94
credited
1.60
instrumental
1.59
involved
1.57
implicated
1.48
Involved
1.48
tasked
1.40
responsible
1.34
invited
1.33
singled
1.32
Activations Density 0.372%