INDEX
Explanations
phrases that indicate possession or relation involving "of."
New Auto-Interp
Negative Logits
Theſe
-0.77
Rüyada
-0.76
pleaſure
-0.76
متعلقه
-0.75
houſe
-0.73
Efq
-0.73
})`
-0.72
});*/
-0.70
})*/
-0.70
itſelf
-0.70
POSITIVE LOGITS
halfway
0.80
outside
0.77
across
0.71
before
0.71
inside
0.69
outside
0.68
near
0.66
around
0.65
Ende
0.64
середине
0.64
Activations Density 0.647%