INDEX
Explanations
collaborate and participate
New Auto-Interp
Negative Logits
panes
0.51
veggies
0.50
delimiters
0.50
grasses
0.50
ripples
0.48
herbivores
0.48
semicircle
0.46
rappers
0.45
markers
0.44
bolas
0.43
POSITIVE LOGITS
collaborate
0.63
collaborates
0.56
collaborating
0.54
collaborated
0.54
Participate
0.53
participate
0.52
регулярно
0.51
participated
0.50
Particip
0.50
particip
0.50
Activations Density 0.058%