INDEX
Explanations
expressions of community and connection among individuals
New Auto-Interp
Negative Logits
lope
-0.16
himself
-0.15
irl
-0.15
itself
-0.15
ifer
-0.14
cente
-0.14
overall
-0.14
Tobias
-0.13
ramer
-0.13
ini
-0.13
POSITIVE LOGITS
/forum
0.15
Eigen
0.15
Assignable
0.14
گاÙĨ
0.14
dio
0.14
tron
0.14
raç
0.14
exion
0.14
gens
0.14
.INPUT
0.14
Activations Density 0.033%