INDEX
Explanations
is especially, is not, is required
is generally
New Auto-Interp
Negative Logits
REALLY
0.50
정말
0.48
すごく
0.47
とっても
0.47
naprawdę
0.47
Really
0.46
本当
0.45
本当に
0.44
めちゃ
0.41
Really
0.41
POSITIVE LOGITS
insufficiently
0.64
unlikely
0.64
believed
0.61
likely
0.59
regarded
0.58
subject
0.54
currently
0.54
scarcely
0.54
widely
0.53
suspected
0.52
Activations Density 0.697%