INDEX
Explanations
statements of fact or assertions
New Auto-Interp
Negative Logits
betweenstory
-0.95
lenker
-0.77
OGND
-0.69
חיצוניים
-0.68
autorytatywna
-0.67
chord
-0.64
StoryboardSegue
-0.63
TestBed
-0.62
uxxxx
-0.62
-0.61
POSITIVE LOGITS
the
0.67
all
0.63
really
0.58
requireNonNull
0.58
truly
0.57
a
0.54
an
0.54
pure
0.54
cosity
0.52
absolutely
0.51
Activations Density 0.127%