INDEX
Explanations
actions related to helping others and community engagement
Preceding verbs related to giving/telling
their and themselves
New Auto-Interp
Negative Logits
them
-2.00
them
-1.69
THEM
-1.32
ándolos
-1.16
них
-1.06
آنها
-1.01
őket
-0.97
آنها
-0.97
ними
-0.94
вони
-0.92
POSITIVE LOGITS
their
2.15
themselves
2.10
Their
1.93
their
1.86
themselves
1.86
Their
1.78
ihre
1.63
ihren
1.60
THEIR
1.44
leurs
1.34
Activations Density 0.918%