INDEX
Explanations
personal relationships or interactions between individuals
references to specific people and relationships
New Auto-Interp
Negative Logits
Ult
-0.77
Turing
-0.76
DoS
-0.76
RFC
-0.75
ãĥĵ
-0.74
hack
-0.73
Forge
-0.71
Chain
-0.71
ãĥķãĤ©
-0.71
Scale
-0.70
POSITIVE LOGITS
woman
1.21
girl
1.19
daughter
1.13
Females
1.13
female
1.07
females
1.06
wife
1.04
lady
1.03
person
1.03
women
1.03
Activations Density 0.474%