INDEX
Explanations
expressions of support and mutual reliance among individuals in a community
Reciprocal actions or relationships
reciprocal actions
New Auto-Interp
Negative Logits
ьте
-0.49
M
-0.48
I
-0.47
<eos>
-0.47
-0.47
T
-0.46
ios
-0.43
CONT
-0.43
-0.43
(
-0.42
POSITIVE LOGITS
mutual
1.55
mutual
1.41
mutually
1.38
nhau
1.29
eachother
1.29
saling
1.29
お互
1.25
Mutual
1.23
einander
1.22
друг
1.18
Activations Density 0.355%