INDEX
Explanations
abbreviations or notation related to scientific or mathematical terms
New Auto-Interp
Negative Logits
RegressionTest
-0.58
s
-0.56
ρίζ
-0.56
Cruz
-0.55
nervios
-0.53
witch
-0.53
ativas
-0.52
boxylic
-0.52
croce
-0.51
AGS
-0.51
POSITIVE LOGITS
SI
1.83
SI
1.78
MI
1.77
PI
1.76
MI
1.70
DI
1.64
PI
1.64
FI
1.58
DI
1.58
BI
1.56
Activations Density 0.101%