INDEX
Explanations
prepositions like "into" and "through" followed by articles or specific nouns
New Auto-Interp
Negative Logits
dotycz
0.99
topologies
0.89
tolerances
0.84
benchmarks
0.82
lications
0.82
technologies
0.82
umbers
0.79
tenements
0.79
theorems
0.79
требований
0.79
POSITIVE LOGITS
Ви
1.22
Ре
1.02
Во
1.01
Ти
0.99
Ди
0.98
До
0.95
Ка
0.93
Фи
0.93
ח
0.91
أ
0.90
Activations Density 0.206%