Python mailbox on large mbox datasets -
i using python bundle mailbox, , trying extract messages , clean data. running problem big databases, can phone call constructor sample file, when seek print messages programme hangs. assume because file trying read on 7gb. how can deal problem?
consider splitting mailbox manually. format easy process (as long need read-only access) reading line-per-line; , can utilize existing classes actual parsing of individual messages.
python mbox
No comments:
Post a Comment