INDEX
Explanations
words and phrases related to introducing oneself or presenting information
references to self-presentation and identity
New Auto-Interp
Negative Logits
ruce
-0.66
erest
-0.61
sterdam
-0.60
iking
-0.58
Herb
-0.57
ciples
-0.56
ï¸ı
-0.55
Tradable
-0.55
pps
-0.54
enfranch
-0.53
POSITIVE LOGITS
ICAN
0.73
predicament
0.71
ional
0.70
æ³
0.70
thin
0.66
threat
0.66
grievances
0.66
dilemma
0.66
ISC
0.64
ographics
0.63
Activations Density 0.127%