INDEX
Negative Logits
Rather
0.91
Instead
0.86
Rather
0.83
のではなく
0.82
Accordingly
0.80
etcétera
0.80
Accordingly
0.80
Instead
0.78
Etc
0.78
ുകയായിരുന്നു
0.77
POSITIVE LOGITS
overall
1.10
compared
1.00
although
0.97
offering
0.96
overall
0.94
!;
0.94
imo
0.93
offrant
0.93
;
0.90
.;
0.89
Activations Density 1.347%