INDEX
Explanations
decision involving specific strings
New Auto-Interp
Negative Logits
здравоохра
0.93
пары
0.93
establece
0.88
получен
0.88
ковые
0.86
темы
0.84
технических
0.84
государ
0.83
técnica
0.83
técnicas
0.82
POSITIVE LOGITS
ı
0.70
In
0.68
Five
0.68
sawing
0.67
In
0.66
using
0.66
willow
0.66
centering
0.65
Cons
0.65
Deer
0.64
Activations Density 0.001%