INDEX
Explanations
references to forums and theatrical terms
New Auto-Interp
Negative Logits
momix
-0.54
înc
-0.52
Grüsse
-0.51
хьтан
-0.50
îna
-0.50
дописавши
-0.50
perfección
-0.49
matchCondition
-0.49
betrek
-0.48
portata
-0.47
POSITIVE LOGITS
forum
0.70
theatre
0.66
theater
0.60
FORUM
0.56
cinema
0.54
Forum
0.53
forum
0.52
Theatre
0.52
Q
0.52
Columns
0.51
Activations Density 0.190%