Differences

This shows you the differences between two versions of the page.

--- doc:appunti:linux:video:ripping_dvds_with_mencoder [2017/10/13 16:49] – [OCRing] niccolo
+++ doc:appunti:linux:video:ripping_dvds_with_mencoder [2020/04/21 17:05] (current) – [OCRing] niccolo
@@ Line 1: / Line 1: @@
 ====== Ripping DVDs with Mencoder ======
+:!: For a simple recipe to rip (extract) the content of a DVD using Debian 10, see **[[vobcopy]]**.
 ===== Install the necessary programs =====
@@ Line 200: / Line 201: @@
 ===== Extract Subtitles with transcode =====
+FIXME The following programs are **missing in Debian 10 Buster**: **tcextract**, **subtitle2vobsub** and **subtitle2pgm**. We are searching for some alternatives.
 DVDs have subtitles stored as images. There are some options for dealing with them:
@@ Line 264: / Line 267: @@
 <code>
-cat subtitles_stream.ps1 | subtitle2pgm -c 255,0,0,255
+cat subtitles_stream.ps1 | subtitle2pgm
 </code>
+If you want to control how grey levels are converted, try to use the **%%-c%%** option of subtitle2pgm, something like: **%%-c 255,0,0,255%%**.
 Each subtitle should now be one file named like **movie_subtitle0003.pgm**, and a **movie_subtitle.srtx** file will be created to index them and their times on-screen.
-=== Tesseract OCR ===
+=== With Tesseract OCR ===
 <code bash>
@@ Line 275: / Line 280: @@
 find . -type f -name '*.pgm' | sort | while read file; do
     echo -n "$(basename $file) "
-    tesseract -l eng -psm 4 "$file" "$file"
+    tesseract -l eng --psm 4 "$file" "$file"
 done
 </code>
-=== Gocr ===
+=== With Gocr ===
 **NOTICE**: Dont' use the following, because Gocr is not the best tool for OCR. Use **Tesseract OCR** instead.
@@ Line 291: / Line 296: @@
 It will prompt you for tons of characters that it doesn't understand, and often totally bugger them up even when you give it the correct ones (it reads part of what it showed you again as another character...)
-====  ====
+==== Make a single .srt file ====
 Now we will re-merge all these text files produced into a big subtitle file: