Temporary Fix to Import 2025 Programs

mikquagl · December 22, 2024, 10:26pm

Description

Hi Brothers, one of the elderly in my congregation just told me that he was having some issues to run the software with 2025 programs.
After a dive deep on the EPUB for the 2024 and 2025 it was clear that there is a difference in the file structure that is preventing the software to run.
I’ve created a simple program to fix the 2025 epubs and use them in TheocBase.

Please, let me know if it’s working for you.
If you have mac and cannot run the .exe file, just drop a message with the nationality needed and I can try to create it for you.

UPDATE

I’ve just made the conversion for English, Italian, Spanish, French and Deutsch.
You can find them in the drive folder.

Download

https://drive.google.com/drive/folders/1bXJx_aTbphfc-xedObwgcRRLSfrAIg5C?usp=sharing
In the folder you will find a video that shows how to use the script and a zip file with the program

How to Use the Program

STEP 1. Download it from the drive folder
STEP 2. Extract the zip file
STEP 3. Open the extracted folder TheocBaseFix
STEP 4. Download the pub for 2025 needed
STEP 5. Move the file inside the TheocBaseFix folder
STEP 6. Drag the file above the program TheocBaseFix.exe
STEP 7. Navigate to the newly created OUTPUT folder
STEP 8. From there you can import the file as always

Code

import copy
import os
import sys
import zipfile
from bs4 import BeautifulSoup


# Function to process an XHTML file
def process_xhtml(file_path):
    print(f"Processing XHTML file: {file_path}")

    try:
        with open(file_path, 'r', encoding='utf-8') as file:
            content = file.read()

        soup = BeautifulSoup(content, 'html.parser')

        # Find the div with class bodyTxt
        body_txt = soup.find('div', class_='bodyTxt')
        if not body_txt:
            print(f"No 'div' with class 'bodyTxt' found in {file_path}")
            return

        # Find all H3 elements
        h2_tags = body_txt.find_all('h3')
        if len(h2_tags) < 2:
            print(f"Less than two H3 tags found in {file_path}")
            return

        # Process elements following the second H3
        for element in h2_tags[2].find_all_next():
            if element.name == 'div' and element.get('id', '').startswith('tt') and not element.find('h2'):
                print(f"Wrapping div with id {element.get('id')} in a new tag in {file_path}")
                new_tag = soup.new_tag('div')
                element.wrap(new_tag)

        # Save changes back to the file
        with open(file_path, 'w', encoding='utf-8') as file:
            file.write(str(soup))
        print(f"File processed and saved: {file_path}")

    except Exception as e:
        print(f"Error processing {file_path}: {e}")


# Function to extract and process EPUB files
def process_epub(epub_path):
    print(f"Processing EPUB file: {epub_path}")

    try:
        with zipfile.ZipFile(epub_path, 'r') as zip_ref:
            extract_dir = "extracted_epub"
            zip_ref.extractall(extract_dir)
        print(f"EPUB extracted to {extract_dir}")

        oebps_path = os.path.join(extract_dir, "OEBPS")

        if not os.path.exists(oebps_path):
            print("OEBPS folder not found in the EPUB structure")
            return

        # Locate all XHTML files to process
        for root, dirs, files in os.walk(oebps_path):
            for file_name in files:
                if file_name.endswith(".xhtml") and "-extracted" not in file_name:
                    file_path = os.path.join(root, file_name)
                    print(f"Found XHTML file to process: {file_path}")
                    process_xhtml(file_path)

        # Optional: Re-compress the EPUB
        output_epub = "./OUTPUT/" + os.path.basename(epub_path)
        os.makedirs(os.path.dirname(output_epub), exist_ok=True)
        with zipfile.ZipFile(output_epub, 'w') as zip_ref:
            for root, dirs, files in os.walk(extract_dir):
                for file in files:
                    file_path = os.path.join(root, file)
                    arcname = os.path.relpath(file_path, extract_dir)
                    zip_ref.write(file_path, arcname)
        print(f"Processed EPUB saved as {output_epub}")

    except Exception as e:
        print(f"Error processing EPUB {epub_path}: {e}")


# Main function
def main():
    # Ensure a file path is passed as an argument
    if len(sys.argv) != 2:
        print("Usage: Drag and drop a file onto this script to process it.")
        return

    # Get the file path from the arguments
    file_path = sys.argv[1]
    print(f"Starting processing for file: {file_path}")
    process_epub(file_path)


if __name__ == "__main__":
    main()

exe created with pyinstaller.

AKuser · December 23, 2024, 11:20pm

Thanks you downloaded and installed the English pubs!

Tim · December 24, 2024, 7:29am

Спасибо дорогие за вашу работу! На русском языке тоже все работает

HenryS · December 24, 2024, 3:42pm

Great job. In German it works perfect.
Thank you very much for your assitance.

Charllys · December 24, 2024, 9:07pm

It worked for programs in Portuguese Portugal and Portuguese Brazil. Thank you very much.

Gilcson · December 25, 2024, 3:29am

Depois dessa atualização, não estou mais conseguindo importar as programações. Por que está faltando dlls no meu computador. Já tentei de tudo.

mikquagl · December 25, 2024, 2:07pm

Which DLL are missing? Do you have any specific error?
If the problem is just that program are imported, but are not complete, you can try to use the above procedure

Gilcson · December 25, 2024, 2:48pm

api-ms-win-crt-runtime-l1-1-0-dll.
A princípio esse é que está faltando.

gntdfor · December 25, 2024, 8:10pm

anche in ITALIANO tutto Ok. Grazie di cuore.

EdiFreitas · December 25, 2024, 10:06pm

Obrigado irmão, deu muito certo…muito obrigado.

Guidoief · December 25, 2024, 11:08pm

Hey, i cant open the .exe file, could you please help me with the dutch nationality? Thanks alot!

Tim · December 26, 2024, 3:45am

Добрый день. К сожалению после того как была назначены задания все слетело(((
Видимо для русского языка не подходит.

Flavio · December 26, 2024, 9:44am

Grazie! Ho seguito le istruzioni che hai postato, e, guardando il video incluso, sono riuscito ad installare correttamente i programmi! Veramente un lavoro egregio! Grazie di cuore!

mikquagl · December 26, 2024, 10:21am

Hey Gilcson, you probably just need to install Visual C++ Redistributable for Visual Studio 2015 https://www.microsoft.com/en-in/download/details.aspx?id=48145

mikquagl · December 26, 2024, 10:26am

Hey Guidoief, here you can find Dutch pub converted → https://drive.google.com/drive/folders/14Tjp53_ePAHv3gFDFdTwiOghkJVQnXUr?usp=drive_link

mikquagl · December 26, 2024, 10:32am

Hey @Tim , you can try with the followings.https://drive.google.com/drive/folders/1SqLmLz20uKwLocSOms03fJOcM5z4QuGJ?usp=drive_link
But, if I got it right, you had problems only after you’ve assigned each task. Can I ask you what happened?

Guidoief · December 26, 2024, 10:07pm

Thanks alot! You’re a lifesaver.

Tim · December 27, 2024, 7:59am

Добрый день! Случилось вот что. После назначения начали повторяться пункты например 2 жемчужины

Tim · December 27, 2024, 8:23am

Спасибо большое тебе друг! Но твои тетради не импортируются. Пишет что расписание недоступно.

goeldi · December 28, 2024, 11:59am

The Linux way

Thank you for this fix! I tested it on Linux (Ubuntu 24.04) and it works.

Howto:

copy the content of the script into the file TheocBaseFix.py
add a shebang as first line:
#!/usr/bin/python3
make the script executable:
chmod 755 TheocBaseFix.py
if pip is not already installed, do it:
sudo apt install python3-pip
This is for Debian family like Ubuntu. For other linux or bsd variants, replace apt with:
- MacOS brew, Arch pacman, RedHat-a-like dnf or yum, openSuSE zypper, Gentoo emerge, Alpine apk add
install the module BeautifulSoup:
- pip install beautifulsoup4
download the epub file from jw.org to the same directory
(in this example I tried with German March 2025)
now the script TheocBaseFix.py and the epub file (e.g. mwb_X_202503.epub are in the same directory)
execute in console:
./TheocBaseFix.py mwb_X_202503.epub
a subdirectory named OUTPUT is created with the fixed epub file as content