INDEX
Explanations
references to significant news events and incidents related to safety and protection
New Auto-Interp
Negative Logits
timewa
-0.58
Datuak
-0.57
.*")]
-0.57
surla
-0.54
alej
-0.52
LikeLike
-0.51
specificity
-0.48
ณ
-0.47
wikipagina
-0.47
cupertino
-0.47
POSITIVE LOGITS
kasarigan
0.64
Tikang
0.61
ValueStyle
0.60
pexpr
0.59
intios
0.56
StoryboardSegue
0.52
PhysRev
0.51
SharedCtor
0.51
künftig
0.51
Spoljašnje
0.51
Activations Density 0.341%