INDEX
Explanations
mentions of entertaining or entertainment
references to entertainment or the act of entertaining
New Auto-Interp
Negative Logits
imil
-0.66
yer
-0.66
mined
-0.61
descent
-0.61
barren
-0.60
jamin
-0.59
prone
-0.59
sil
-0.59
orah
-0.58
installed
-0.58
POSITIVE LOGITS
tainment
1.13
entertain
1.07
entertained
0.99
glers
0.90
entert
0.81
esc
0.81
ments
0.80
TextColor
0.78
eering
0.77
issance
0.77
Activations Density 0.017%