INDEX
Explanations
words related to policies, regulations, and political actions
statements that reference the effects and roles of something significant, often using the word "it."
New Auto-Interp
Negative Logits
ombo
-0.72
IFE
-0.72
english
-0.66
onica
-0.66
uga
-0.65
itu
-0.65
agon
-0.64
isphere
-0.63
Albion
-0.62
ibble
-0.62
POSITIVE LOGITS
also
0.97
encourage
0.91
encourages
0.89
allow
0.86
additionally
0.86
eliminate
0.85
instruct
0.83
facilitate
0.83
furthermore
0.83
thereby
0.82
Activations Density 0.489%