INDEX
Explanations
references to various types of warnings and alerts
New Auto-Interp
Negative Logits
even
-0.87
anyway
-0.85
indeed
-0.84
either
-0.81
actually
-0.80
даже
-0.76
justru
-0.76
even
-0.75
persino
-0.75
zwar
-0.74
POSITIVE LOGITS
ItemLayout
0.68
וגם
0.67
impressive
0.65
))));
0.58
affected
0.56
betroffen
0.56
Италијани
0.56
Impressive
0.53
."));
0.52
']).
0.52
Activations Density 0.364%