INDEX
Explanations
pronouns referring to people or entities previously mentioned in the text
references to people or groups in relation to their actions or characteristics
New Auto-Interp
Negative Logits
AND
-0.71
onsequ
-0.64
Meaning
-0.62
ising
-0.62
PLIC
-0.60
Trop
-0.59
ont
-0.58
ounding
-0.57
istance
-0.56
Instruct
-0.56
POSITIVE LOGITS
owns
1.10
specializes
1.00
specialize
0.95
participated
0.92
loves
0.88
understands
0.87
possesses
0.86
enjoys
0.85
cares
0.84
participates
0.84
Activations Density 0.118%