INDEX
Explanations
verbs that convey actions or processes related to communication and interaction
New Auto-Interp
Negative Logits
ſtate
-0.67
fubject
-0.63
Chriftian
-0.59
ftate
-0.58
raiſ
-0.57
houſe
-0.56
pleaſure
-0.55
ſever
-0.55
fevere
-0.54
ujednoznacz
-0.54
POSITIVE LOGITS
setVerticalGroup
0.60
ches
0.59
es
0.56
icks
0.56
otes
0.56
tifies
0.55
kes
0.54
odes
0.53
ains
0.53
itself
0.52
Activations Density 0.560%