INDEX
Explanations
words and phrases that add nuance or a degree of uncertainty to a claim.
abstract concepts
New Auto-Interp
Negative Logits
+#+#
-0.98
'\\;'
-0.82
démocratie
-0.80
bonté
-0.75
colère
-0.75
nahilalakip
-0.73
sagesse
-0.71
dignité
-0.70
république
-0.69
veille
-0.67
POSITIVE LOGITS
situation
1.11
mixture
1.03
idea
0.96
combination
0.95
solution
0.94
approach
0.92
framework
0.90
timeframe
0.90
scheme
0.90
concept
0.89
Activations Density 10.196%