INDEX
Explanations
greek words and translations
New Auto-Interp
Negative Logits
ignoring
1.08
harmful
1.05
Конгрегация
1.05
letting
1.01
unreliable
1.00
cumbersome
1.00
related
1.00
increasingly
1.00
conflicting
1.00
determining
1.00
POSITIVE LOGITS
β
1.65
και
1.64
με
1.62
η
1.61
σε
1.61
από
1.60
τα
1.59
το
1.58
της
1.57
την
1.56
Activations Density 0.015%