INDEX
Explanations
names and mentions of celebrities
New Auto-Interp
Negative Logits
transQ
-0.47
heça
-0.44
exposiciones
-0.43
invokingState
-0.41
journées
-0.41
progrès
-0.39
práctico
-0.39
mandiri
-0.38
capucha
-0.38
ouvert
-0.38
POSITIVE LOGITS
celebrity
0.73
celebrities
0.73
celebs
0.69
superstar
0.65
celebrity
0.65
يتيمه
0.63
BeginContext
0.63
superstars
0.60
فريبيس
0.57
Celebrity
0.57
Activations Density 0.334%