INDEX
Explanations
phrases indicating receipt or acknowledgment of something positive or valuable
New Auto-Interp
Negative Logits
itſelf
-1.04
myſelf
-0.97
fubject
-0.92
ſelf
-0.89
themſelves
-0.87
ſtate
-0.87
―――――
-0.81
purpoſe
-0.79
Reſ
-0.78
ſelves
-0.77
POSITIVE LOGITS
gets
0.96
get
0.95
received
0.91
got
0.91
receives
0.89
fikk
0.88
Gets
0.87
kregen
0.85
receive
0.85
får
0.84
Activations Density 0.297%