INDEX
Explanations
pronouns and their associated actions or states in plural and singular contexts
New Auto-Interp
Negative Logits
the
-0.48
\
-0.45
Re
-0.44
.
-0.43
-
-0.42
a
-0.42
↵↵
-0.41
*
-0.41
return
-0.41
ebb
-0.40
POSITIVE LOGITS
complexContent
1.02
ThroughAttribute
1.01
tvguidetime
1.00
TagMode
1.00
)_/¯
0.98
nakalista
0.98
openzeppelin
0.97
متعلقه
0.96
devamını
0.95
فريبيس
0.94
Activations Density 0.287%