INDEX
Explanations
references to comparisons and relationships among different groups or entities
New Auto-Interp
Negative Logits
SequentialGroup
-0.51
snapshot
-0.49
#%
-0.48
I
-0.48
D
-0.47
Koala
-0.45
Prow
-0.45
ValueStyle
-0.45
regia
-0.45
шру
-0.44
POSITIVE LOGITS
colleagues
1.05
colleague
1.03
colega
0.90
colegas
0.90
counterparts
0.90
rekan
0.88
fellow
0.88
Colleagues
0.86
collègues
0.84
predecessors
0.84
Activations Density 0.187%