INDEX
Explanations
punctuation marks indicating lists or enumerations
New Auto-Interp
Negative Logits
Pron
-0.54
Darfur
-0.52
rheumat
-0.52
vnode
-0.51
erat
-0.51
incest
-0.51
Boko
-0.50
Laredo
-0.49
needful
-0.49
Fleetwood
-0.48
POSITIVE LOGITS
}))
0.86
}));
0.83
)";
0.82
'>
0.81
)");
0.80
]));
0.78
"});
0.75
]),
0.73
/>";
0.73
")));
0.72
Activations Density 0.519%