天天看點

java中操作pdf【pdfbox】【itext】

PDF轉PNG圖檔【pdfbox】

/***
     * PDF檔案轉PNG圖檔,全部頁數
     * @param PdfFilePath pdf完整路徑
     * @param imgFilePath 圖檔存放的檔案夾
     * @param dpi dpi越大轉換後越清晰,相對轉換速度越慢
     * @return
     */
    public void pdf2Image(String PdfFilePath, String dstImgFolder, int dpi) {
        File file = new File(PdfFilePath);
        PDDocument pdDocument;
        try {
            String imgPDFPath = file.getParent();
            int dot = file.getName().lastIndexOf(\'.\');
            String imagePDFName = file.getName().substring(0, dot); // 擷取圖檔檔案名
            String imgFolderPath = null;
            if (dstImgFolder.equals("")) {
                imgFolderPath = imgPDFPath + File.separator + imagePDFName;// 擷取圖檔存放的檔案夾路徑
            } else {
                imgFolderPath = dstImgFolder + File.separator + imagePDFName;
            }
            if (createDirectory(imgFolderPath)) {
                pdDocument = PDDocument.load(file);
                PDFRenderer renderer = new PDFRenderer(pdDocument);
                /* dpi越大轉換後越清晰,相對轉換速度越慢 */
                //下面兩行代碼擷取pdf頁數用到了itext,也可以用pdfbox,具體方法後面給出
                PdfReader reader = new PdfReader(PdfFilePath);
                int pages = reader.getNumberOfPages();
                StringBuffer imgFilePath = null;
                for (int i = 0; i < pages; i++) {
                    String imgFilePathPrefix = imgFolderPath + File.separator + imagePDFName;
                    imgFilePath = new StringBuffer();
                    imgFilePath.append(imgFilePathPrefix);
                    imgFilePath.append("_");
                    imgFilePath.append(String.valueOf(i + 1));
                    imgFilePath.append(".png");
                    File dstFile = new File(imgFilePath.toString());
                    BufferedImage image = renderer.renderImageWithDPI(i, dpi);
                    ImageIO.write(image, "png", dstFile);
                }
                System.out.println("PDF文檔轉PNG圖檔成功!");
            } else {
                System.out.println("PDF文檔轉PNG圖檔失敗:" + "建立" + imgFolderPath + "失敗");
            }
        } catch (IOException e) {
            e.printStackTrace();
        }

           

下面兩個方法都可以擷取pdf的頁數【pdfbox】

PDDocument doc = PDDocument.load("e://aa.pdf");

System.out.println(doc.getNumberOfPages());

List<?> pages = doc.getDocumentCatalog().getAllPages();

System.out.println(pages.size());

Apache PDFBox 相關,參考:http://blog.csdn.net/chszs/article/details/9026439

Itext相關,參考http://api.itextpdf.com/pdfa/ ; http://www.open-open.com/open25134.htm