INDEX
Explanations
contrast or transition words
New Auto-Interp
Negative Logits
lordship
0.49
رو
0.44
mone
0.44
circuitry
0.43
رد
0.43
ार्ड
0.43
settlements
0.43
টের
0.41
វិ
0.41
uitle
0.41
POSITIVE LOGITS
Firstly
0.71
Firstly
0.66
Besides
0.63
firstly
0.63
Owing
0.60
Besides
0.57
Among
0.57
Compared
0.55
Xiao
0.55
Actually
0.54
Activations Density 0.003%