INDEX
Explanations
elements referencing key components or main ideas in a written context
New Auto-Interp
Negative Logits
testdata
-0.53
urunan
-0.49
ulitis
-0.49
isters
-0.49
acuzzi
-0.49
مرئيه
-0.49
ividu
-0.48
Txt
-0.48
itivo
-0.48
Diweddarwch
-0.48
POSITIVE LOGITS
main
0.92
utama
0.91
principales
0.83
głów
0.81
principaux
0.81
głó
0.81
principais
0.81
huvud
0.79
principali
0.78
principal
0.77
Activations Density 0.522%