INDEX
Explanations
adjectives used to describe different aspects of a situation, such as fairness, controversy, danger, and innovation
adjectives that describe intensity or severity in various contexts
New Auto-Interp
Negative Logits
adelphia
-0.82
©¶æ
-0.75
aters
-0.73
culus
-0.72
tu
-0.69
lance
-0.69
Joy
-0.68
ipers
-0.68
iets
-0.67
urtles
-0.67
POSITIVE LOGITS
alike
1.11
truths
0.75
striped
0.74
views
0.73
combinations
0.73
perspectives
0.73
opinions
0.71
ifiable
0.71
viewpoints
0.70
respectively
0.70
Activations Density 0.207%