INDEX
Explanations
social media and online integration-related terms, along with technological features related to social platforms
New Auto-Interp
Negative Logits
Directions
-0.72
urban
-0.67
vich
-0.65
sbm
-0.64
WATCHED
-0.63
thia
-0.62
pard
-0.61
Males
-0.61
trailing
-0.59
followed
-0.59
POSITIVE LOGITS
brink
1.01
heights
1.00
levels
0.98
notch
0.97
level
0.90
peak
0.88
highs
0.88
extremes
0.85
saturation
0.85
threshold
0.85
Activations Density 0.296%