My old man has a bunch of .dox stuff saved. He has complicated large files saved that are not supported by any of the FOSS conversion tools. I’ve tried Libre office, Abi Word, and every command line tool and converter I can find. These are entire book sized files.

I have a W10 machine with Word. Is extracting the .exe and running it with wine feasible without making an epic mess or massive project of this?

  • Presi300@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    17 days ago

    Generally, no. M$ office has some pretty invasive DRM, so your best bet to running it on linux is to run it on a windows virtual machine

  • Tippon@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    0
    ·
    21 days ago

    Have you tried the online version of MS Office? I’m not sure, but I think there’s a free version. Depending on the file, you might be able to convert it to another format, then use a FOSS tool going forwards.

        • j4k3@lemmy.worldOP
          link
          fedilink
          English
          arrow-up
          0
          ·
          21 days ago

          free is not as in freedom from stalkerware, and there is only a 1 month blank-check impossibscribe extortion type trial scam.

          • Grangle1@lemm.ee
            link
            fedilink
            arrow-up
            0
            ·
            21 days ago

            If you’re already thinking of extracting/attempting to run a desktop version of Office, you may as well save yourself the effort if you can and give the free online version a try. You’ll be using a proprietary piece of software either way.

            • j4k3@lemmy.worldOP
              link
              fedilink
              English
              arrow-up
              0
              ·
              21 days ago

              I’m looking at trying to convert a file type that Linux doesn’t presently have a capability of opening. What is this toxic nonsense? Seriously, I’m trying to get my ignorant old man on FOSS. I don’t give a * about m$. I also think evangelical nonsense is worse than m$, so curb that nonsense please. This isn’t some Arch forum.

    • neidu2@feddit.nl
      link
      fedilink
      arrow-up
      0
      ·
      21 days ago

      I was thinking along the same lines. Use the online version available via portal.office.com, and use that to convert everything to something more FOSS-friendly.

      Not sure if access is free, though.

      • Telorand@reddthat.com
        link
        fedilink
        arrow-up
        0
        ·
        21 days ago

        This is what I would recommend as well. Try to convert within Word to an older version or open version that’s likely to be compatible with other software. Test one and see if it converts okay.

  • just_another_person@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    21 days ago

    You can try Pandoc and see if that works, Google Docs, Office365, finding an abandonware version of Word and running on Wine…lots of options to work with.

    It might be easier to start narrowing down where you need to look if you get the header info from one of these files.

    • j4k3@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      0
      ·
      21 days ago

      I can’t even extract the header. It is coming up as a binary, but it is just mystery encoding shenanigans of m$ criminals failing to build to the iso standard, no doubt, to cause this very issue and attempt to justify infinite payments for a finite tool. The error is in position 22 according to iconv. With $ bat, it reports binary, Only Office won’t even display and fails to pop-up. Both AbiWord and Libre Office detect over a dozen Asian languages and freezes after ~1-2 min, but display. Python3 Docx can’t even load the ‘document = Document(’/tmp/test.docx’)` line.

      • just_another_person@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        21 days ago

        Okay. First off, I downvoted you for obvious reasons.

        Second, if you’re not sure how to extract the header of a file, just Google that. You may be ill prepared and asking for help here.

        • j4k3@lemmy.worldOP
          link
          fedilink
          English
          arrow-up
          0
          ·
          21 days ago

          You earned the following block. You make foolish assumptions. The header is garbled in Linux entirely there is nothing available to extract with any tools.

          • just_another_person@lemmy.world
            link
            fedilink
            arrow-up
            0
            ·
            21 days ago

            You don’t understand how file formats work I guess. You can’t just ‘head’ an encoded file and expect a terminal to output what you want. Do some research.

  • _edge@discuss.tchncs.de
    link
    fedilink
    arrow-up
    0
    ·
    21 days ago

    I wouldn’t even try with wine these days.

    Why don’t you use the Win10 machine you have, the online version of Microsoft Office (web browser or app), a VM with Windows, or (if it works for your case) Google Docs or OnlyOffice.

  • data1701d (He/Him)@startrek.website
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    21 days ago

    How old of Docx files are you talking? Something like Office 2010 might run quite well, and your father would have probably had to have used some very weird features for it to be incompatible.

    • j4k3@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      0
      ·
      21 days ago

      It’s recent stuff. He likes a Fedora machine, but the only issue is the inability to replicate his Word docs. At this point I don’t know if Libre tools will be easy enough for him. There us nothing really intelligent happening in these files. He is not capable of handling any kind of real world complexity. So everything is going to be whatever was made super simple and easy in m$ Word.

      I went through my old CD collection, but don’t have disks beyond Office for XP.

  • wizardbeard@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    0
    ·
    21 days ago

    Assuming the latest version of OpenOffice doesn’t work for these files…


    My next course of action would be using the Win 10 machine with Word, or a VM with Win10 or 11 and the latest version of Word. Use MASGrave to trick M$ into considering it licensed if you need to.

    Use a Powershell script to interact with Word through the COM object interface and automate opening Word, opening the file, saving it as a different filetype, and closing. Here’s a snippet of Powershell from Reddit for going in the opposite direction (odt to docx) for a single file. I wouldn’t try to do this through Linux, just suck it up and use Windows so you don’t have an extra layer of mess to deal with.

    Going off M$ documentation of the save types enum, I would replace “wdFormatDocumentDefault” in that snippet with wdFormatOpenDocumentText or wdFormatStrictOpenXMLDocument, then test it with a single file to see which gives the output you need.

    Getting all the files of the starting type from a folder can be done using Get-ChildItem. Store those in a variable and use a foreach loop over the initial file list.

    • j4k3@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      0
      ·
      21 days ago

      All of the office suites seem to use either Python 3 docx or iconvert under the surface. These tools do not support whatever default encoding m$ is using. It is clearly a font encoding issue, but I won’t know what that font is until my back is in good enough shape to setup a desktop at my bedside workstation.

  • I thought we stopped doing the “m$” thing around 2010.

    Word barely supports old Word files. Very few tools can reproduce .doc files other than Office itself, and even Office versions aren’t all compatible.

    My approach would be to install some kind of Office on a machine and just script the hell out of opening files and saving them as docx or whatever open format Word supports these days. Word exposes a COM interface you can script against, so most programming languages and JScript or VBS can automate this process.

    If you can figure out how to scan files in a loop, this snippet may get you started:

    Set word = CreateObject("Word.Application")
    word.Visible = True
    word.Documents.Open("C:\Documents and Settings\User\Hello.doc")
    Set doc = word.ActiveDocument
    doc.SaveAs "C:\Documents and Settings\Gebruiker\Mijn documenten\export.docx", 16
    word.Quit()
    

    To do this with reasonable speed, keep one instance of word around and close the documents rather than quitting Word every time you iterate through the list.

  • absGeekNZ@lemmy.nz
    link
    fedilink
    English
    arrow-up
    0
    ·
    21 days ago

    I have office 2007 on a winxp VM, I haven’t had to use it in a few years, but it is there as a back up

      • absGeekNZ@lemmy.nz
        link
        fedilink
        English
        arrow-up
        0
        ·
        21 days ago

        Long past, but for old files especially, old .doc files it is great as a backup.

        It lives in a VM that never has access to the internet, it almost never gets started up.

  • UnbalancedFox@lemmy.ca
    link
    fedilink
    arrow-up
    0
    ·
    21 days ago

    I bought a cheap win11+office 2021 combo on the net and use a VM. Its not the easiest way but it works…

    😔