INDEX
Explanations
references to online forums
references to forums or online discussion platforms
New Auto-Interp
Negative Logits
lys
-0.71
Lilly
-0.65
nutrition
-0.64
birds
-0.64
efe
-0.63
meter
-0.61
ochond
-0.60
loo
-0.60
ectar
-0.59
kowski
-0.59
POSITIVE LOGITS
forum
0.83
forums
0.83
postings
0.82
moderators
0.80
moderator
0.76
thread
0.73
discussions
0.72
posts
0.70
izen
0.70
nas
0.69
Activations Density 0.021%