INDEX
Explanations
references to family relationships and social gatherings
New Auto-Interp
Negative Logits
#
-0.69
vectorielles
-0.63
للمعارف
-0.62
-0.62
Operator
-0.61
Baillargeon
-0.61
mattino
-0.60
MotionEvent
-0.59
complémentaires
-0.58
espagne
-0.57
POSITIVE LOGITS
friends
0.75
friend
0.68
relatives
0.65
uncles
0.64
Uncle
0.64
family
0.63
uncle
0.63
cousin
0.63
aunts
0.62
social
0.62
Activations Density 0.264%