INDEX
Explanations
references to social status and personal insecurities related to one's appearance and circumstances
New Auto-Interp
Negative Logits
andra
-0.16
ύ
-0.16
anager
-0.15
StackNavigator
-0.15
amik
-0.15
icha
-0.14
имв
-0.14
Conce
-0.14
èIJ
-0.13
tober
-0.13
POSITIVE LOGITS
alongside
0.91
beside
0.85
next
0.77
next
0.65
neben
0.61
junto
0.57
vedle
0.54
.next
0.52
-next
0.51
Next
0.50
Activations Density 0.337%