INDEX
Explanations
mentions of cognitive states or cognitive abilities
terminology related to cognitive functions or mental faculties
New Auto-Interp
Negative Logits
ItemTracker
-0.77
nesses
-0.74
found
-0.70
emale
-0.68
ngth
-0.68
upid
-0.66
Soc
-0.66
cur
-0.65
satisf
-0.65
pleasant
-0.65
POSITIVE LOGITS
administration
1.36
Administration
1.05
Ĥİ
0.87
admin
0.77
ä½ľ
0.75
Cheong
0.72
aders
0.69
Admin
0.68
axis
0.66
ains
0.64
Activations Density 0.000%