INDEX
Explanations
user profiles or social media user information
occurrences of the word "Profile."
New Auto-Interp
Negative Logits
abeth
-0.86
atorial
-0.71
chest
-0.70
actic
-0.70
RECT
-0.70
iah
-0.68
ago
-0.67
erc
-0.66
mediately
-0.66
meaning
-0.65
POSITIVE LOGITS
Profile
1.19
Joined
1.00
profiles
0.94
profile
0.85
allery
0.84
Blog
0.83
Features
0.81
Occupations
0.78
Artist
0.73
Quote
0.73
Activations Density 0.009%