INDEX
Explanations
references to media and entertainment, particularly regarding celebrity news or gossip
celebrity news and entertainment
New Auto-Interp
Negative Logits
kháu
-0.61
estekak
-0.59
فريبيس
-0.56
onCreateView
-0.53
hyrchwyd
-0.53
الرياضيه
-0.50
الاطلاع
-0.50
AxisAlignment
-0.50
חיצוניים
-0.50
DoubleQuotes
-0.49
POSITIVE LOGITS
ünl
0.34
<bos>
0.34
antaranya
0.33
wakili
0.33
entertainment
0.32
gossip
0.32
amizade
0.32
promoção
0.31
voici
0.30
aparência
0.30
Activations Density 0.053%