INDEX
Explanations
mentions of friends or feelings of attraction
words related to friendships and relationships
New Auto-Interp
Negative Logits
close
-1.70
closer
-1.65
close
-1.48
Close
-1.46
closer
-1.40
closest
-1.39
Closer
-1.32
Close
-1.32
CLOSE
-1.30
CLOSE
-1.13
POSITIVE LOGITS
فريبيس
0.71
photolibrary
0.66
ſelves
0.63
CreateTagHelper
0.59
Мексичка
0.59
säll
0.58
Pilate
0.58
Efq
0.57
ároz
0.57
ocities
0.56
Activations Density 0.833%