INDEX
Explanations
references to online discussion forums
mentions of online discussion platforms or community forums
New Auto-Interp
Negative Logits
othes
-0.69
Marg
-0.68
osing
-0.68
Flags
-0.67
lev
-0.66
obal
-0.65
ocr
-0.62
oses
-0.61
OSE
-0.60
Dmit
-0.60
POSITIVE LOGITS
forum
1.34
forums
1.30
Forums
0.96
forum
0.90
moderators
0.89
thread
0.85
forums
0.85
discussions
0.83
moderator
0.83
postings
0.79
Activations Density 0.005%