Qpdf view text in text editor
I have used qpdf with
qpdf --qdf --object-streams=disable orig.pdf expanded.pdf
As reference from https://unix.stackexchange.com/a/109177/306249
I can see all objects in text editor. But I cannot view the text in the pdf.
I have test with "Hello world" text in pdf. But I can't see this text after decompress.
How to view the text with objects?
add a comment |
I have used qpdf with
qpdf --qdf --object-streams=disable orig.pdf expanded.pdf
As reference from https://unix.stackexchange.com/a/109177/306249
I can see all objects in text editor. But I cannot view the text in the pdf.
I have test with "Hello world" text in pdf. But I can't see this text after decompress.
How to view the text with objects?
add a comment |
I have used qpdf with
qpdf --qdf --object-streams=disable orig.pdf expanded.pdf
As reference from https://unix.stackexchange.com/a/109177/306249
I can see all objects in text editor. But I cannot view the text in the pdf.
I have test with "Hello world" text in pdf. But I can't see this text after decompress.
How to view the text with objects?
I have used qpdf with
qpdf --qdf --object-streams=disable orig.pdf expanded.pdf
As reference from https://unix.stackexchange.com/a/109177/306249
I can see all objects in text editor. But I cannot view the text in the pdf.
I have test with "Hello world" text in pdf. But I can't see this text after decompress.
How to view the text with objects?
edited Aug 19 at 8:52
Pierre.Vriens
96651015
96651015
asked Aug 19 at 7:33
Gee Nino
142
142
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
Even after expansion of all objects, texts (strings and single characters) do not need to be represented in ASCII, they may be hex-encoded.
To find the text, proceed as follows:
In your expanded PDF, look for all keys named
/Contents
. It may look like:
/Contents 8 0 R
This tells you that the contents of the respective page is in object number 8.
Go to object number 8. This can be found by searching for the string
'8 0 obj'
In the following lines, bracketed by the lines
stream ... endstream
, if you see at the end of a line either one of...
...
TJ
,Tj
,'
or"
you'll have a text showing operator at work.
The preceding line holds the text, but it may look like:
[(H)0.0976563(e)0.0976563(l)-599.902(l)0.0976563(o)0.0976563(W)0.0976563(o)-599.902(r)0.0976563(l)0.0976563(d)0.0976563(!)]TJ
Hey, you were lucky! Can you decipher the "Hello World!" string here? The intermediate numbers are only to control the placement of the individual characters....
....and now I'll stop to teach PDF. You can read all the details in the official PDF format specification :-)
Just one more hint: if you search for
my other PDF-related answers on StackOverflow,
you may discover quite a few examples which go into more details about how to read PDF code.
add a comment |
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "106"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f463455%2fqpdf-view-text-in-text-editor%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
Even after expansion of all objects, texts (strings and single characters) do not need to be represented in ASCII, they may be hex-encoded.
To find the text, proceed as follows:
In your expanded PDF, look for all keys named
/Contents
. It may look like:
/Contents 8 0 R
This tells you that the contents of the respective page is in object number 8.
Go to object number 8. This can be found by searching for the string
'8 0 obj'
In the following lines, bracketed by the lines
stream ... endstream
, if you see at the end of a line either one of...
...
TJ
,Tj
,'
or"
you'll have a text showing operator at work.
The preceding line holds the text, but it may look like:
[(H)0.0976563(e)0.0976563(l)-599.902(l)0.0976563(o)0.0976563(W)0.0976563(o)-599.902(r)0.0976563(l)0.0976563(d)0.0976563(!)]TJ
Hey, you were lucky! Can you decipher the "Hello World!" string here? The intermediate numbers are only to control the placement of the individual characters....
....and now I'll stop to teach PDF. You can read all the details in the official PDF format specification :-)
Just one more hint: if you search for
my other PDF-related answers on StackOverflow,
you may discover quite a few examples which go into more details about how to read PDF code.
add a comment |
Even after expansion of all objects, texts (strings and single characters) do not need to be represented in ASCII, they may be hex-encoded.
To find the text, proceed as follows:
In your expanded PDF, look for all keys named
/Contents
. It may look like:
/Contents 8 0 R
This tells you that the contents of the respective page is in object number 8.
Go to object number 8. This can be found by searching for the string
'8 0 obj'
In the following lines, bracketed by the lines
stream ... endstream
, if you see at the end of a line either one of...
...
TJ
,Tj
,'
or"
you'll have a text showing operator at work.
The preceding line holds the text, but it may look like:
[(H)0.0976563(e)0.0976563(l)-599.902(l)0.0976563(o)0.0976563(W)0.0976563(o)-599.902(r)0.0976563(l)0.0976563(d)0.0976563(!)]TJ
Hey, you were lucky! Can you decipher the "Hello World!" string here? The intermediate numbers are only to control the placement of the individual characters....
....and now I'll stop to teach PDF. You can read all the details in the official PDF format specification :-)
Just one more hint: if you search for
my other PDF-related answers on StackOverflow,
you may discover quite a few examples which go into more details about how to read PDF code.
add a comment |
Even after expansion of all objects, texts (strings and single characters) do not need to be represented in ASCII, they may be hex-encoded.
To find the text, proceed as follows:
In your expanded PDF, look for all keys named
/Contents
. It may look like:
/Contents 8 0 R
This tells you that the contents of the respective page is in object number 8.
Go to object number 8. This can be found by searching for the string
'8 0 obj'
In the following lines, bracketed by the lines
stream ... endstream
, if you see at the end of a line either one of...
...
TJ
,Tj
,'
or"
you'll have a text showing operator at work.
The preceding line holds the text, but it may look like:
[(H)0.0976563(e)0.0976563(l)-599.902(l)0.0976563(o)0.0976563(W)0.0976563(o)-599.902(r)0.0976563(l)0.0976563(d)0.0976563(!)]TJ
Hey, you were lucky! Can you decipher the "Hello World!" string here? The intermediate numbers are only to control the placement of the individual characters....
....and now I'll stop to teach PDF. You can read all the details in the official PDF format specification :-)
Just one more hint: if you search for
my other PDF-related answers on StackOverflow,
you may discover quite a few examples which go into more details about how to read PDF code.
Even after expansion of all objects, texts (strings and single characters) do not need to be represented in ASCII, they may be hex-encoded.
To find the text, proceed as follows:
In your expanded PDF, look for all keys named
/Contents
. It may look like:
/Contents 8 0 R
This tells you that the contents of the respective page is in object number 8.
Go to object number 8. This can be found by searching for the string
'8 0 obj'
In the following lines, bracketed by the lines
stream ... endstream
, if you see at the end of a line either one of...
...
TJ
,Tj
,'
or"
you'll have a text showing operator at work.
The preceding line holds the text, but it may look like:
[(H)0.0976563(e)0.0976563(l)-599.902(l)0.0976563(o)0.0976563(W)0.0976563(o)-599.902(r)0.0976563(l)0.0976563(d)0.0976563(!)]TJ
Hey, you were lucky! Can you decipher the "Hello World!" string here? The intermediate numbers are only to control the placement of the individual characters....
....and now I'll stop to teach PDF. You can read all the details in the official PDF format specification :-)
Just one more hint: if you search for
my other PDF-related answers on StackOverflow,
you may discover quite a few examples which go into more details about how to read PDF code.
answered Dec 17 at 23:12
Kurt Pfeifle
47038
47038
add a comment |
add a comment |
Thanks for contributing an answer to Unix & Linux Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f463455%2fqpdf-view-text-in-text-editor%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown