Issue Details (XML | Word | Printable)

Key: NXP-2166
Type: Bug Bug
Status: Resolved Resolved
Resolution: Fixed
Priority: Minor Minor
Assignee: Thierry Delprat
Reporter: Olivier Grisel
Votes: 0
Watchers: 1
Operations

If you were logged in you would be able to see more operations.
Google issue summary
Nuxeo Enterprise Platform

transform tries to extract word file fulltext for empty blobs

Created: 04/03/08 20:14   Updated: 06/03/08 01:19
Component/s: Search / Query, Transforms
Affects Version/s: 5.1.3.2
Fix Version/s: 5.1.4, 5.2 M1

Time Tracking:
Original Estimate: 4 hours
Original Estimate - 4 hours
Remaining Estimate: 4 hours
Remaining Estimate - 4 hours
Time Spent: Not Specified
Remaining Estimate - 4 hours

Deliverable: Code + unit test
Resolution Date: 06/03/08 01:19
Require Callback: No
Participants: Olivier Grisel and Thierry Delprat
Tags:


 Description  « Hide
At any document modification for a File with an empty file:content field, one gets the following error in the logs (see first comment).

 All   Comments   Work Log   Change History      Sort Order: Ascending order - Click to sort in descending order
Olivier Grisel added a comment - 04/03/08 20:15 - edited
19:12:40,244 ERROR [AbstractTransformer] An error occured while trying to perform a transformation chain.
java.io.IOException: Unable to read entire header; 6 bytes read; expected 512 bytes
        at org.apache.poi.poifs.storage.HeaderBlockReader.<init>(HeaderBlockReader.java:83)
        at org.apache.poi.poifs.filesystem.POIFSFileSystem.<init>(POIFSFileSystem.java:90)
        at org.apache.poi.hwpf.HWPFDocument.verifyAndBuildPOIFS(HWPFDocument.java:120)
        at org.apache.poi.hwpf.extractor.WordExtractor.<init>(WordExtractor.java:49)
        at org.nuxeo.ecm.platform.transform.plugin.poi.WordToTextPlugin.transform(WordToTextPlugin.java:70)
        at org.nuxeo.ecm.platform.transform.transformer.AbstractTransformer.transform(AbstractTransformer.java:174)
        at org.nuxeo.ecm.platform.transform.service.TransformService.transform(TransformService.java:205)
        at org.nuxeo.ecm.platform.transform.service.TransformService.transform(TransformService.java:222)
        at org.nuxeo.ecm.core.search.blobs.NXTransformBlobExtractor.extract(NXTransformBlobExtractor.java:85)
        at org.nuxeo.ecm.core.search.api.backend.indexing.resources.factory.ResolvedResourcesFactory.blobToText(ResolvedResourcesFactory.java:273)
        at org.nuxeo.ecm.core.search.api.backend.indexing.resources.factory.ResolvedResourcesFactory.convertForFullText(ResolvedResourcesFactory.java:237)
        at org.nuxeo.ecm.core.search.api.backend.indexing.resources.factory.ResolvedResourcesFactory.extractForFullText(ResolvedResourcesFactory.java:171)
        at org.nuxeo.ecm.core.search.api.backend.indexing.resources.factory.ResolvedResourcesFactory.computeFulltext(ResolvedResourcesFactory.java:121)
        at org.nuxeo.ecm.core.search.api.backend.indexing.resources.factory.ResolvedResourcesFactory.computeAggregatedResolvedResourcesFrom(ResolvedResourcesFactory.java:299)
        at org.nuxeo.ecm.core.search.service.SearchServiceImpl.index(SearchServiceImpl.java:266)
        at org.nuxeo.ecm.core.search.threading.IndexingTask.run(IndexingTask.java:73)
        at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:885)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:907)
        at java.lang.Thread.run(Thread.java:619)
19:12:40,244 INFO [AbstractTransformer] Global transformation chain terminated for transformer name=word2text_poi Duration :1ms