INDEX
Explanations
mentions of the word "carnival" or its variations
New Auto-Interp
Negative Logits
engu
-0.17
uyu
-0.16
Sher
-0.16
iras
-0.15
REP
-0.15
ersen
-0.15
\brief
-0.14
Latch
-0.14
Ĺı
-0.14
advertisement
-0.14
POSITIVE LOGITS
egie
0.26
ivals
0.25
aval
0.21
oust
0.20
ival
0.20
age
0.19
IVAL
0.19
elian
0.18
forth
0.17
al
0.17
Activations Density 0.006%