INDEX
Explanations
subjective statements reflecting personal opinions or experiences
New Auto-Interp
Negative Logits
istar
-0.16
udder
-0.14
ugal
-0.14
vlastnÄĽ
-0.13
ourcem
-0.13
_bases
-0.13
bä
-0.13
642
-0.13
>(()
-0.13
InstanceOf
-0.12
POSITIVE LOGITS
certainly
0.70
definitely
0.57
Certainly
0.54
surely
0.50
def
0.48
sure
0.48
Definitely
0.43
def
0.41
CERT
0.40
sure
0.40
Activations Density 0.588%