Skip to content

编辑概览

PDF Oxide 提供两个级别的 API 来编辑现有 PDF:高级 Pdf 类(推荐)和底层 DocumentEditor。两者都允许你打开 PDF、修改内容和元数据、跟踪变更并保存结果。

打开 PDF 进行编辑

Python

from pdf_oxide import PdfDocument

doc = PdfDocument("input.pdf")

编辑器在首次修改时延迟初始化。你可以立即开始读取,编辑器在你调用任何修改方法(如 set_title()page())时激活。

WASM

import { WasmPdfDocument } from "pdf-oxide-wasm";

const bytes = new Uint8Array(/* file bytes */);
const doc = new WasmPdfDocument(bytes);

Rust

使用统一的 Pdf API:

use pdf_oxide::api::Pdf;

let mut doc = Pdf::open("input.pdf")?;

或直接使用 DocumentEditor 进行底层控制:

use pdf_oxide::editor::DocumentEditor;

let mut editor = DocumentEditor::open("input.pdf")?;

检查修改

在保存之前,你可以检查是否有任何更改:

Python

doc = PdfDocument("input.pdf")
print(doc.is_modified)  # False -- no changes yet

doc.set_title("Updated Title")
print(doc.is_modified)  # True

Rust

let mut doc = Pdf::open("input.pdf")?;
assert!(!doc.is_modified());

doc.editor().unwrap().set_title("Updated Title");
assert!(doc.is_modified());

保存

Python

doc = PdfDocument("input.pdf")
doc.set_title("New Title")
doc.save("output.pdf")

WASM

import { WasmPdfDocument } from "pdf-oxide-wasm";

const bytes = new Uint8Array(/* file bytes */);
const doc = new WasmPdfDocument(bytes);
doc.setTitle("New Title");
const output = doc.save();
doc.free();

Rust

let mut doc = Pdf::open("input.pdf")?;
doc.editor().unwrap().set_title("New Title");
doc.save("output.pdf")?;

// Or save to a new path
doc.save_as("copy.pdf")?;

save() 方法默认执行 PDF 的完整重写。对于高级保存选项(增量更新、加密),参见加密与安全

文档元数据

读取和写入标准 PDF 元数据字段:标题、作者、主题和关键词。

Python

from pdf_oxide import PdfDocument

doc = PdfDocument("input.pdf")

# Set metadata
doc.set_title("Quarterly Report")
doc.set_author("Jane Smith")
doc.set_subject("Q4 2025 Financial Results")
doc.set_keywords("finance, quarterly, 2025")

doc.save("output.pdf")

WASM

import { WasmPdfDocument } from "pdf-oxide-wasm";

const bytes = new Uint8Array(/* file bytes */);
const doc = new WasmPdfDocument(bytes);

// Set metadata
doc.setTitle("Quarterly Report");
doc.setAuthor("Jane Smith");
doc.setSubject("Q4 2025 Financial Results");
doc.setKeywords("finance, quarterly, 2025");

const output = doc.save();
doc.free();

Rust

use pdf_oxide::editor::DocumentEditor;

let mut editor = DocumentEditor::open("input.pdf")?;

// Read metadata
if let Some(title) = editor.title()? {
    println!("Current title: {}", title);
}
if let Some(author) = editor.author()? {
    println!("Current author: {}", author);
}
if let Some(subject) = editor.subject()? {
    println!("Current subject: {}", subject);
}
if let Some(keywords) = editor.keywords()? {
    println!("Current keywords: {}", keywords);
}

// Set metadata
editor.set_title("Quarterly Report");
editor.set_author("Jane Smith");
editor.set_subject("Q4 2025 Financial Results");
editor.set_keywords("finance, quarterly, 2025");

editor.save("output.pdf")?;

文档信息

源路径和版本

use pdf_oxide::editor::DocumentEditor;

let editor = DocumentEditor::open("input.pdf")?;

// Path to the original file
println!("Source: {}", editor.source_path());

// PDF version as (major, minor)
let (major, minor) = editor.version();
println!("PDF version: {}.{}", major, minor);

// Number of pages
println!("Pages: {}", editor.current_page_count());

完整 API 参考

DocumentEditor

方法 返回值 描述
open(path) Result<DocumentEditor> 打开 PDF 进行编辑
is_modified() bool 检查是否有任何更改
source_path() &str 源 PDF 的路径
source() &PdfDocument 对源文档的只读访问
version() (u8, u8) PDF version (major, minor)
current_page_count() usize 文档中的页数
title() Result<Option<String>> 获取文档标题
set_title(title) () 设置文档标题
author() Result<Option<String>> 获取文档作者
set_author(author) () 设置文档作者
subject() Result<Option<String>> 获取文档主题
set_subject(subject) () 设置文档主题
keywords() Result<Option<String>> 获取文档关键词
set_keywords(keywords) () 设置文档关键词
save(path) Result<()> 完整重写保存
save_with_options(path, options) Result<()> 使用自定义选项保存

Pdf(统一 API)

方法 返回值 描述
Pdf::open(path) Result<Pdf> 打开 PDF 进行编辑
Pdf::open_editor(path) Result<DocumentEditor> Open directly as DocumentEditor
is_modified() bool 检查是否有更改
save(path) Result<()> 保存文档
save_as(path) Result<()> 保存到新路径
page(index) Result<PdfPage> 获取用于 DOM 编辑的页面
save_page(page) Result<()> 保存修改后的页面
editor() Option<&mut DocumentEditor> 访问底层编辑器

EditableDocument trait

EditableDocument trait 定义了核心编辑契约:

pub trait EditableDocument {
    fn get_info(&mut self) -> Result<DocumentInfo>;
    fn set_info(&mut self, info: DocumentInfo) -> Result<()>;
    fn page_count(&mut self) -> Result<usize>;
    fn get_page_info(&mut self, index: usize) -> Result<PageInfo>;
    fn remove_page(&mut self, index: usize) -> Result<()>;
    fn move_page(&mut self, from: usize, to: usize) -> Result<()>;
    fn duplicate_page(&mut self, index: usize) -> Result<usize>;
    fn save(&mut self, path: impl AsRef<Path>) -> Result<()>;
    fn save_with_options(&mut self, path: impl AsRef<Path>, options: SaveOptions) -> Result<()>;
}

完整编辑工作流

此示例演示了完整的编辑会话:打开、检查、修改元数据、编辑内容和保存。

Python

from pdf_oxide import PdfDocument

# Open the document
doc = PdfDocument("report.pdf")
print(f"Pages: {doc.page_count()}")

# Update metadata
doc.set_title("Annual Report 2025")
doc.set_author("Finance Team")

# Edit text on page 0
page = doc.page(0)
for text in page.find_text_containing("DRAFT"):
    page.set_text(text.id, "FINAL")
doc.save_page(page)

# Save
doc.save("report-final.pdf")

Rust

use pdf_oxide::api::Pdf;

let mut doc = Pdf::open("report.pdf")?;
println!("Pages: {}", doc.page_count()?);

// Update metadata
{
    let editor = doc.editor().unwrap();
    editor.set_title("Annual Report 2025");
    editor.set_author("Finance Team");
}

// Edit text on page 0
let mut page = doc.page(0)?;
let drafts = page.find_text_containing("DRAFT");
for t in &drafts {
    page.set_text(t.id(), "FINAL")?;
}
doc.save_page(page)?;

// Save
doc.save("report-final.pdf")?;

相关页面