INDEX
Explanations
questions or doubts about various topics or situations
phrases that express skepticism or inquiry about topics
New Auto-Interp
Negative Logits
idae
-0.80
ymph
-0.73
--+
-0.72
////////
-0.69
ê
-0.69
istg
-0.68
++;
-0.68
ï¸
-0.66
////////////////
-0.66
Interstitial
-0.66
POSITIVE LOGITS
validity
1.24
legality
1.23
legitimacy
1.12
motives
1.07
authenticity
1.07
assumptions
1.02
eligibility
0.99
propri
0.99
morality
0.96
credibility
0.96
Activations Density 0.189%