INDEX
Explanations
specific numeric values or factual data
New Auto-Interp
Negative Logits
دانشنامهٔ
-0.71
Datuak
-0.70
hoeddwyd
-0.68
िने
-0.66
ellido
-0.65
numerusform
-0.65
onsored
-0.64
BeginContext
-0.61
fsm
-0.61
trăm
-0.61
POSITIVE LOGITS
5
0.68
subsubsection
0.66
7
0.60
Clik
0.60
الحره
0.59
flé
0.58
SceneManagement
0.58
McL
0.58
8
0.58
DISE
0.57
Activations Density 0.128%