INDEX
Explanations
phrases related to providing information or explanations, possibly in a formal or technical context
phrases related to significant events or actions involving individuals or groups
New Auto-Interp
Negative Logits
aven
-0.72
agriculture
-0.71
gren
-0.71
masked
-0.70
forestry
-0.70
net
-0.69
societies
-0.69
resent
-0.69
istration
-0.69
infl
-0.69
POSITIVE LOGITS
Own
1.20
Affect
1.14
Relationship
1.14
Approach
1.06
Appro
1.05
Course
1.05
Guys
1.04
Clause
1.02
Character
1.02
Stupid
1.02
Activations Density 0.541%