INDEX
Explanations
phrases indicative of the significance of relationships and personal connections
quantities like many or few
New Auto-Interp
Negative Logits
Anſ
-0.70
Monfieur
-0.70
myſelf
-0.69
Jefus
-0.69
ércoles
-0.67
المكان
-0.66
Theſe
-0.65
RSSSF
-0.64
themſelves
-0.64
himſelf
-0.64
POSITIVE LOGITS
many
0.97
few
0.96
few
0.82
those
0.81
wenigen
0.78
ujednoznacz
0.74
Few
0.72
several
0.71
many
0.70
FEW
0.70
Activations Density 0.077%