INDEX
Explanations
instances of verbs indicating involvement or states of being, particularly in the context of community engagement and personal interests
New Auto-Interp
Negative Logits
ÑĦи
-0.15
arc
-0.15
overlooked
-0.14
ARC
-0.14
neglected
-0.14
ÑĦÑĸ
-0.14
gren
-0.13
segregated
-0.13
aa
-0.13
лÑĥги
-0.13
POSITIVE LOGITS
YRO
0.17
able
0.17
Fauc
0.15
IAM
0.14
iam
0.14
entanyl
0.14
kin
0.14
ÅĻiv
0.14
aze
0.14
runner
0.14
Activations Density 0.290%