INDEX
Explanations
pronoun followed by auxiliary verb
New Auto-Interp
Negative Logits
because
0.69
because
0.68
BECAUSE
0.66
因为
0.64
fordi
0.62
azonban
0.61
因为
0.60
যেহেতু
0.59
Because
0.58
omdat
0.57
POSITIVE LOGITS
correspondingly
0.94
consequently
0.86
accordingly
0.84
จึง
0.77
daher
0.74
forcément
0.73
Consequently
0.71
donc
0.70
Accordingly
0.68
只好
0.68
Activations Density 0.010%