INDEX
Explanations
possessive forms indicating ownership or attributes
New Auto-Interp
Negative Logits
»
-0.80
lehem
-0.79
Õ
-0.79
Ú
-0.77
ENN
-0.76
Balt
-0.75
ÙIJ
-0.74
cible
-0.73
Ïī
-0.72
BT
-0.72
POSITIVE LOGITS
biggest
1.03
penchant
0.99
inability
0.99
detractors
0.97
spokesman
0.95
obsession
0.93
newest
0.92
refusal
0.91
motives
0.91
actions
0.90
Activations Density 0.130%