INDEX
Explanations
details about people and places, particularly focusing on events and descriptions
references to specific people, places, and events
New Auto-Interp
Negative Logits
clusive
-0.68
animate
-0.64
>[
-0.61
ilion
-0.61
ĵ
-0.60
plete
-0.58
IFF
-0.57
ãĥij
-0.57
DRAG
-0.57
oppable
-0.56
POSITIVE LOGITS
agrees
1.44
testified
1.37
commented
1.36
remembers
1.36
tells
1.35
remarked
1.31
disagrees
1.31
believes
1.29
thinks
1.29
wrote
1.28
Activations Density 0.475%