If Artificial General Intelligence has an okay outcome, what will be the reason?

Plus

517

Ṁ420k

2200

16%

Humanity coordinates to prevent the creation of potentially-unsafe AIs.

Alignment is not properly solved, but core human values are simple enough that partial alignment techniques can impart these robustly. Despite caring about other things, it is relatively cheap for AGI to satisfy human values.

AIs will not have utility functions (in the same sense that humans do not), their goals such as they are will be relatively humanlike, and they will be "computerish" and generally weakly motivated compared to humans.

Yudkowsky is trying to solve the wrong problem using the wrong methods based on a wrong model of the world derived from poor thinking and fortunately all of his mistakes have failed to cancel out

We create a truth economy. https://manifold.markets/Krantz/is-establishing-a-truth-economy-tha?r=S3JhbnR6

Someone solves agent foundations

Other

Eliezer finally listens to Krantz.

Development and deployment of advanced AI occurs within a secure enclave which can only be interfaced with via a decentralized governance protocol

Human consciousness is needed to collapse wave function, and AI can't do it. Thus humans should be preserved and they may require complete friendliness in exchange (or they will be unhappy and produce bad collapses)

Ethics turns out to be a precondition of superintelligence

For some reason, the optimal strategy for AGIs is just to head somewhere with far more resources than Earth, as fast as possible. All unaligned AGIs immediately leave, and, for some reason, do not leave anything behind that kills us.

Someone creates AGI(s) in a box, and offers to split the universe. They somehow find a way to arrange this so that the AGI(s) cannot manipulate them or pull any tricks, and the AGI(s) give them instructions for safe pivotal acts.

1.6%

Nick Bostrom's idea (Hail Mary) that AI will preserve humans to trade with possible aliens works

1.1%

Moral Realism is true, the AI discovers this and the One True Morality is human-compatible.

1.1%

The response to AI advancements or failures makes some governments delay the timelines

Nanotech is difficult without experiments, so no mail order AI Grey Goo; Humans will be the main workhorse of AI everywhere. While they will be exploited, this will be like normal life from inside

Someone at least moderately sane leads a campaign, becomes in charge of a major nation, and starts a secret project with enough resources to solve alignment, because it turns out there's a way to convert resources into alignment progress.

AGI is never built (indefinite global moratorium)

We make risk-conservative requests to extract alignment-related work out of AI-systems that were boxed prior to becoming superhuman. We somehow manage to achieve a positive feedback-loop in alignment/verification-abilities.

Duplicate of https://manifold.markets/EliezerYudkowsky/if-artificial-general-intelligence with user-submitted answers. An outcome is "okay" if it gets at least 20% of the maximum attainable cosmopolitan value that could've been attained by a positive Singularity (a la full Coherent Extrapolated Volition done correctly), and existing humans don't suffer death or any other awful fates.

This question is managed and resolved by Manifold.

#AI

#Fun

Get

1,000

and

3.00

25 Comments

489 Holders

7k Trades

Sort by:

bought Ṁ150 YES

Voted based on my research, summarized here: https://blog.ideanexusventures.com/the-conscious-economy/

bought Ṁ50 NO

That seems like a lot of fancy neologisms to say “we should use AI to automate tedious things like paperwork”, and I don’t see what it has to do with the quantum mechanics aspect of the market you bet on.

@Kronopath how did I get in on this at 1.8% to 9% and now the orderbook looks like this? lol insane this was ever so low.

sold Ṁ407 NO

@EliezerYudkowsky I really think it should be more like 0.001% (10^-24%?) of the "maximum attainable cosmopolitan value that could've been attained by a positive Singularity (a la full Coherent Extrapolated Volition done correctly)".

bought Ṁ10 YES

An outcome is "okay" if it gets at least 20% of the maximum attainable cosmopolitan value that could've been attained by a positive Singularity (a la full Coherent Extrapolated Volition done correctly), and existing humans don't suffer death or any other awful fates.

Tons of unimaginably amazing, extremely good futures don't qualify as "okay" by this definition, hmm.

What exactly is the plan to resolve the multiple non-contradictory resolution criteria? Will there be some kind of "weighted according to my gut feeling of how important they are"? Will they all resolve "yes"? Or is it "I will pick the one that was most centrally true"?

It would be nice if there was some kind of flow-chart for resolution like in my "if AI causes human extinction" market.

I've blocked Krantz, which I don't know whether it prevents him from creating new answers. I don't seem to have the ability to resolve the current answers N/A, and would hesitate to resolve "No" under the circumstances unless a mod okays that.

@EliezerYudkowsky

I don't seem to have the ability to resolve the current answers N/A, and would hesitate to resolve "No" under the circumstances unless a mod okays that.

Unfortunately this is a dependent multiple choice market, so all options have to resolve (summing to 100% or N/A) at the same time. So it's not a question of whether that's ok with mods, it simply isn't possible given the market structure.

It's a not uncommon issue that popular dependent MC markets get many unwanted answers added. It would be great if there were better tools to control this, but unfortunately the options are pretty blunt. My personal recommendation (but totally up to you) would be to change the market settings so that only the creator can add answers---then, people can make suggestions in the comments, and you can choose whether to include them or not. (I can make that change to the settings if you'd prefer, but it's under the 3 dots for more market options).

You can also feel free to edit any unwanted answers to just say "N/A" or "Ignore" or etc, to partially clean up the market (& clarify where attention should go). That's very much within your right as creator. But there's no way to actually remove the options (or resolve them early, although they will quickly go to ~0% with natural betting).

@EliezerYudkowsky If it's not too much of a hassle, would you also consider making an unlinked version of this market with the most promising options copied over, so that the non mutually exclusive options don't distort each others' probabilities? I know I could do this myself if necessary but your influence brings vastly more attention to the market and this seems like a fairly important market question. Maybe the wording would need to be very slightly altered to "...what will be true of the reason?"

@EliezerYudkowsky Least hassle approach: Start with "Duplicate" in the menu…

…then "Choose question type"…

…choose "Set" instead…

…delete the answers you don't want to keep. (When I tested, the answers carried over.)

@EliezerYudkowsky An alternative to N/A-ing this entire market would be to unlist it:

…in response to @TheAllMemeingEye's concern that "[this market] makes the site look bad being promoted so high on the home page".

bought Ṁ10 NO

@4fa superb advice :) I didn't realise it was that easy lol

@EliezerYudkowsky I would recommend to just edit all of Krantz’s options to [Resolves No]

Bafflingly, @EliezerYudkowsky appears to be the (distant) second-biggest Yes holder on Krantz’s options. I’m not sure how that happened. (Some kind of auto-betting from betting on “Other” or something?)

@Kronopath When one holds YES shares in 'Other', one is awarded that number of YES shares in any subsequently added options.

@Kronopath In addition to what jim explained, you can also see that it says "Spent Ṁ0".

Despite being blocked, he's back again throwing mana at his own options, ffs. I am in favor of editing all of Krantz’s options to [Resolves No].

@Krantz This was too long to fit.

Enough people understand that we can control a decentralize GOFAI by using a decentralized constitution that is embedded into a free and open market that sovereign individuals can earn a living by aligning. Peace and sanity is achieved game theoretically by making the decentralized process that interpretably advances alignment the same process we use to create new decentralized money. We create an economy that properly rewards the production of valuable alignment data and it feels a lot like a school that pays people to check each other's homework. It is a mechanism that empowers people to earn a living by doing alignment work decentrally in the public domain. This enables us to learn the second bitter lesson: "We needed to be collecting a particular class of data, specifically confidence and attention intervals for propositions (and logical connections of propositions) within a constitution.".

If we radically accelerated the collection of this data by incentivizing it's growth monetarily in a way that empowers poor people to become deeply educated, we might just survive this.

@Krantz you forgot to mention the sexual component.

@Krantz Given what's going on with decentralized money, this is a comparison that predicts failure.

The fact that the Krantz stuff is #2 and #3 here and not something like "one of OpenAI/Anthropic/DeepMind solves the alignment problem" indicates a complete market failure.

bought Ṁ50 YES

@LoganZoellner Maybe you should correct the market. I've got plenty of limit orders to be filled.

@LoganZoellner personally I would actually support total N/A at this point given the nonsensical nature of a linked market with non mutually exclusive options, it makes the site look bad being promoted so high on the home page

@Krantz

>Maybe you should correct the market. I've got plenty of limit orders to be filled.

Given this market appears completely nonsensical, I have absolutely 0 faith that my ability to stay liquid will outlast this market's ability to be irrational.

I have had bad luck in the past with investing in markets where the outcome criteria was basically "the author will choose one of these at random at a future date".

Also, note that this market isn't monetized, so even though I'm 99.9999999999% sure that neither of those options will resolve positively, there isn't actually any way for me to profit off that information.

bought Ṁ50 YES

A friend made a two video series about it, he is pretty smart and convinced me that AI fear is kind of misguided

https://youtu.be/RbMWIzJEeaQ?si=asqn6uadLXPpeDjJ

Related questions

Related questions