INDEX
Explanations
phrases related to thoughts, ideas, and opinions
expressions of thoughts or opinions about various subjects
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.75
ãĤ¬
-0.72
ãĤ´ãĥ³
-0.70
ãĤº
-0.68
contention
-0.67
WAR
-0.66
Nationwide
-0.66
adra
-0.64
OURCE
-0.63
Regions
-0.62
POSITIVE LOGITS
'd
1.00
might
0.99
sounded
0.92
prudent
0.87
ought
0.84
joking
0.83
odd
0.82
might
0.82
funny
0.79
hilarious
0.77
Activations Density 0.112%