INDEX
Explanations
structured data or parameters related to configuration settings in a technical context
New Auto-Interp
Negative Logits
دانشنامهٔ
-0.51
exigencias
-0.45
libremente
-0.40
Tarifs
-0.39
prohibido
-0.37
îna
-0.37
足够的
-0.37
koruyucu
-0.36
pemuda
-0.36
künf
-0.36
POSITIVE LOGITS
avg
0.74
Avg
0.71
total
0.68
Total
0.65
AVG
0.64
Percent
0.59
Normalized
0.59
ब्रेकडाउन
0.59
percent
0.58
Average
0.56
Activations Density 0.969%