INDEX
Explanations
punctuation marks, particularly quotes and parentheses, indicating dialogue or citations
New Auto-Interp
Negative Logits
+#+#
-0.69
/</
-0.53
estekak
-0.51
/
-0.50
kam
-0.49
/−
-0.48
driver
-0.48
};*/
-0.48
film
-0.47
Дорогие
-0.47
POSITIVE LOGITS
Roskov
0.69
stanovnika
0.65
trypsin
0.64
"...
0.60
buckwheat
0.59
unittest
0.58
Бахар
0.57
linkovi
0.57
Vikipedi
0.57
endblock
0.57
Activations Density 0.246%