INDEX
Explanations
adjectives and phrases expressing opinions or evaluations
statements expressing serious concerns or issues
New Auto-Interp
Negative Logits
bats
-0.54
alks
-0.48
testers
-0.46
Languages
-0.46
keys
-0.46
slides
-0.45
bowls
-0.45
anas
-0.45
Talks
-0.44
keyboards
-0.44
POSITIVE LOGITS
soType
0.65
explan
0.63
disadvant
0.56
Beir
0.55
coincidence
0.54
ECA
0.53
indictment
0.52
shenan
0.52
brainer
0.50
gie
0.50
Activations Density 1.078%