INDEX
Negative Logits
Favor
0.57
favoring
0.55
Hulu
0.54
ക്കാൻ
0.52
accomplishing
0.52
favors
0.51
메서
0.51
Thoreau
0.51
FAVOR
0.51
analyzes
0.50
POSITIVE LOGITS
Whilst
1.25
Whilst
1.23
whilst
1.13
maths
0.90
personalised
0.89
UK
0.89
organise
0.89
personalised
0.88
Maths
0.88
tyres
0.87
Activations Density 0.001%