INDEX
Explanations
Latin suffixes and Greek words
New Auto-Interp
Negative Logits
the
1.70
h
1.68
ts
1.54
t
1.53
In
1.45
It
1.43
time
1.39
tions
1.32
There
1.31
ty
1.27
POSITIVE LOGITS
يد
1.30
もの
1.25
۵
1.25
ון
1.23
フィルタ
1.21
પ્રકાર
1.20
ные
1.19
та
1.16
۰
1.15
botched
1.13
Activations Density 0.629%