INDEX
Explanations
references to friendship and companionship
New Auto-Interp
Negative Logits
senal
-0.73
millenn
-0.72
chloride
-0.69
Cout
-0.64
ilion
-0.63
teen
-0.61
untreated
-0.61
nsic
-0.61
elsius
-0.60
etheus
-0.60
POSITIVE LOGITS
liest
1.68
liness
1.66
lier
1.46
lies
1.41
ship
1.21
ships
1.20
finder
0.90
strom
0.82
hips
0.81
friend
0.80
Activations Density 0.023%