INDEX
Explanations
terms related to different types of objects or metaphorical concepts
metaphorical language related to animals and objects used to illustrate concepts or situations
New Auto-Interp
Negative Logits
AMI
-0.63
VICE
-0.63
Dresden
-0.58
rity
-0.58
Ability
-0.58
degraded
-0.58
NZ
-0.58
kson
-0.55
Recre
-0.55
HCR
-0.55
POSITIVE LOGITS
bowl
0.79
fide
0.78
headed
0.78
neck
0.76
bag
0.76
pit
0.73
tight
0.73
wagon
0.73
wagon
0.69
cake
0.69
Activations Density 0.491%