INDEX
Explanations
repeated phrases or patterns emphasizing statements
New Auto-Interp
Negative Logits
الدراسه
-0.92
cdti
-0.66
wikipagina
-0.64
Managua
-0.60
RegisterType
-0.60
*/;
-0.59
CMV
-0.59
ocino
-0.58
singleton
-0.58
rungsseite
-0.58
POSITIVE LOGITS
Thats
0.88
Thats
0.87
thats
0.79
why
0.74
thats
0.71
That
0.71
这就是
0.70
That
0.63
isOk
0.63
這就是
0.63
Activations Density 0.112%