Open Menu
AllLocalCommunitiesAbout
lotide
AllLocalCommunitiesAbout
Login

Microsoft launches ‘vibe working’ in Excel and Word

⁨115⁩ ⁨likes⁩

Submitted ⁨⁨4⁩ ⁨days⁩ ago⁩ by ⁨TheBat@lemmy.world⁩ to ⁨technology@lemmy.world⁩

https://www.theverge.com/news/787076/microsoft-office-agent-mode-office-agent-anthropic-models

source

Comments

Sort:hotnewtop
  • InEnduringGrowStrong@sh.itjust.works ⁨4⁩ ⁨days⁩ ago

    Microsoft says its Agent Mode in Excel has an accuracy rate of 57.2 percent in SpreadsheetBench, a benchmark for evaluating an AI model’s ability to edit real world spreadsheets.

    It generates 42.8% bullshit.

    source
    • jubilationtcornpone@sh.itjust.works ⁨4⁩ ⁨days⁩ ago

      They probably view that as a statistic worth bragging about. It’s not. If Excel got calculations right 57.2% of the time it would be completely worthless.

      source
      • PerogiBoi@lemmy.ca ⁨4⁩ ⁨days⁩ ago

        I asked copilot to look through my every spreadsheet and find how many instances of a category occurred. I was curious to see if it was any good. Gave me 2 different numbers. Neither were correct.

        source
        • -> View More Comments
      • FreedomAdvocate@lemmy.net.au ⁨4⁩ ⁨days⁩ ago

        Did you read the next sentence? Humans only get like 72% right. It’s not far off at all.

        source
        • -> View More Comments
    • MadMadBunny@lemmy.ca ⁨4⁩ ⁨days⁩ ago

      So it achieved the actual proficiency of a middle manager…

      source
      • MonkderVierte@lemmy.zip ⁨4⁩ ⁨days⁩ ago

        Decades ago. The company that replaced it’s CEO with a LLM thrives.

        source
    • potoo22@programming.dev ⁨4⁩ ⁨days⁩ ago

      Just keep regenerating data until it’s something the stock holders like. Doesn’t matter if it’s BS. They’re already accustomed to that.

      source
    • SkaveRat@discuss.tchncs.de ⁨4⁩ ⁨days⁩ ago

      Nice. Basically a coin flip

      source
      • GasMaskedLunatic@lemmy.dbzer0.com ⁨4⁩ ⁨days⁩ ago

        Slightly better than Vegas. Unfortunately, plenty of people are okay with Vegas odds.

        source
    • sparky@lemmy.federate.cc ⁨3⁩ ⁨days⁩ ago

      defector.com/it-took-many-years-and-billions-of-d…

      source
    • Imgonnatrythis@sh.itjust.works ⁨4⁩ ⁨days⁩ ago

      Not enough accuracy to be useful. Not enough bullshit for politics.

      source
  • Fyrnyx@kbin.melroy.org ⁨4⁩ ⁨days⁩ ago

    Oh it's going to do it for Word too?

    Prompt: Termination letter telling my boss and bosses to kindly go fuck themselves and make it professional

    source
    • calliope@retrolemmy.com ⁨4⁩ ⁨days⁩ ago

      The best you can do in any job is to care as little about them as they care about you.

      They will barely read it, and they won’t care nearly as much as you do.

      I resign my position as a [position], effective [DATE].

      source
  • MonkderVierte@lemmy.zip ⁨4⁩ ⁨days⁩ ago

    The best cancers of both worlds.

    source
  • supersquirrel@sopuli.xyz ⁨4⁩ ⁨days⁩ ago

    So let me fast forward a bit, ->underpaid stressed out techworkers in the global south pretending to be AI for incompetent upper management in wealthy countries?

    source
    • Fizz@lemmy.nz ⁨3⁩ ⁨days⁩ ago

      Not related but does global south refer to south of the equator or just everything south of north America?

      source
      • supersquirrel@sopuli.xyz ⁨3⁩ ⁨days⁩ ago

        I don’t know if it is a perfect term, but it doesn’t literally refer to any specific “South”, rather I think it is a reference to the coincidence that many of the heavily industrialized empires of the 18th, 19th and 20th centuries have been in the northern hemisphere, and the general colonial power dynamic therein set up has lead to the term “Global South” meaning pretty much anywhere that has gotten the short end of the colonialism stick, vs the long end.

        source
  • kramer@slevin.horse ⁨4⁩ ⁨days⁩ ago

    They’re out smarting the sheet that’s for sure.

    source
  • FreedomAdvocate@lemmy.net.au ⁨4⁩ ⁨days⁩ ago

    Excel is one place where AI makes sense. All the data is there, in a nice structured and typed format with headings etc.

    source
    • Olap@lemmy.world ⁨4⁩ ⁨days⁩ ago

      LLMs can’t count. Can’t add. Can’t deal with actually large datasets

      How is excel a good fit for vibe-coding?

      source
      • FreedomAdvocate@lemmy.net.au ⁨3⁩ ⁨days⁩ ago

        This isn’t just an LLM. It uses excel functions and features to do the counting and adding and dealing with large data sets.

        It’s not “vibe coding” as much as “vibe performing steps in excel”.

        source
        • -> View More Comments
    • bigfondue@lemmy.world ⁨4⁩ ⁨days⁩ ago

      Until it starts pulling data from a nonexistent worksheets

      source
      • FreedomAdvocate@lemmy.net.au ⁨3⁩ ⁨days⁩ ago

        You tell it not to.

        I swear none of you guys have even attempted to use AI to do data analysis. I have, I built a MCP and integrated a copilot agent into Teams which has access to specific database data, and refined the rules for it to the point where the CFO rigorously tested it (and still does) and trusts the results it returns.

        source
        • -> View More Comments
    • jacksilver@lemmy.world ⁨4⁩ ⁨days⁩ ago

      It could be good to layer in standard machine learning (ML), and it already does have some features (like line of best fit).

      However, in today’s context AI means LLMs, and that is not a good fit due to its unpredictability.

      source