Comparing doc or pdf Files from Using SQL Server

  • hi friends

    i have very interesting qeustion

    can we comparing two another files with sql server

    for example

    in varbinary column have a pdf format file

    in inserting new data in this table first i need compare

    files with another files in my table

    have this file in this table or not

    and

    can we compare files finding text from inserting file data

    thanks

  • natigsqlserver (4/16/2016)


    hi friends

    i have very interesting qeustion

    can we comparing two another files with sql server

    for example

    in varbinary column have a pdf format file

    in inserting new data in this table first i need compare

    files with another files in my table

    have this file in this table or not

    and

    can we compare files finding text from inserting file data

    thanks

    Not enough information. Please post the CREATE TABLE statement for this table. I'll also tell you that if you don't have the file name and the date of the file stored in this table, it will take a VERY long time to compare the varbinary column if that's what you're thinking here.

    Last but not least, what are you using now to determine if a file needs to be loaded and what are you using to actually do the file load?

    --Jeff Moden


    RBAR is pronounced "ree-bar" and is a "Modenism" for Row-By-Agonizing-Row.
    First step towards the paradigm shift of writing Set Based code:
    ________Stop thinking about what you want to do to a ROW... think, instead, of what you want to do to a COLUMN.

    Change is inevitable... Change for the better is not.


    Helpful Links:
    How to post code problems
    How to Post Performance Problems
    Create a Tally Function (fnTally)

  • Do you need all the text to match, meaning are you only concerned about the text in a PDF or does the formatting matter?

    I'm asking because they're different. I could reformat a PDF slightly, not changing any of the information (pictures/words) and a binary comparison would be different.

    SQL Server does have the ability to read the words from PDFs using the iFilter interface and full text search. However, I'm not sure this is what you want, or if this would work. I haven't tried an exact comparison of large text in an iFTS situation. That's interesting. I'll see if I can find a way to try this.

    Note: Performance would likely be poor, so if you were comparing more than two items, meaning a search, this would probably not work well.

Viewing 3 posts - 1 through 2 (of 2 total)

You must be logged in to reply to this topic. Login to reply