INDEX
Explanations
concepts related to fact-checking and verification
phrases that discuss the concept of "fact" and its implications in various contexts, often in a critical or analytical manner
New Auto-Interp
Negative Logits
Receiver
-0.73
Passenger
-0.72
payroll
-0.70
exit
-0.67
Seasons
-0.67
imports
-0.67
receipts
-0.66
refunds
-0.66
wait
-0.66
Hai
-0.65
POSITIVE LOGITS
oriented
1.70
driven
1.62
laden
1.56
based
1.55
focused
1.51
filled
1.50
abiding
1.48
packed
1.48
conscious
1.46
loving
1.46
Activations Density 0.069%