INDEX
Explanations
references to familial relationships and connections
New Auto-Interp
Negative Logits
Member
-0.45
مورد
-0.42
grandchild
-0.42
glą
-0.40
the
-0.40
lle
-0.40
member
-0.39
grandchildren
-0.38
さま
-0.38
MEMBER
-0.38
POSITIVE LOGITS
تقاوى
1.04
betweenstory
0.95
Савезне
0.89
abestanden
0.89
0.88
EndContext
0.87
RTEE
0.87
ImageContext
0.85
Portale
0.83
exitRule
0.83
Activations Density 0.124%