INDEX
Explanations
information related to online forums and discussions
New Auto-Interp
Negative Logits
IDENT
-0.68
horizont
-0.64
behavi
-0.64
ITNESS
-0.62
creat
-0.62
hist
-0.62
ãĥ¼ãĥĨ
-0.62
negie
-0.62
traged
-0.61
nep
-0.61
POSITIVE LOGITS
################################
0.89
Edited
0.86
########
0.73
ãħĭ
0.72
Posts
0.72
Bought
0.69
eals
0.68
âĨij
0.68
reply
0.68
region
0.67
Activations Density 3.499%