INDEX
Explanations
definite pronouns and abbreviations
New Auto-Interp
Negative Logits
a
0.49
plutonium
0.47
..........
0.44
timestamps
0.42
UserSettings
0.42
thresholds
0.42
kappa
0.42
followup
0.41
simulations
0.41
isotopes
0.40
POSITIVE LOGITS
י
0.55
It
0.48
Managing
0.44
و
0.43
સહ
0.42
त्रि
0.42
வும்
0.41
ﱢ
0.41
i
0.41
ت
0.41
Activations Density 0.000%