INDEX
Explanations
functioning adequately and consistently
New Auto-Interp
Negative Logits
übrigens
0.95
plutôt
0.89
Whatever
0.85
provavelmente
0.85
whichever
0.83
whoever
0.83
rather
0.82
むしろ
0.80
approx
0.79
presumably
0.79
POSITIVE LOGITS
fully
1.98
adequately
1.93
Fully
1.63
sepenuhnya
1.56
truly
1.55
properly
1.52
sufficiently
1.52
Fully
1.45
完全
1.39
consistently
1.38
Activations Density 0.724%