Click here to monitor SSC
SQLServerCentral is supported by Red Gate Software Ltd.
 
Log in  ::  Register  ::  Not logged in
 
 
 
        
Home       Members    Calendar    Who's On


Add to briefcase

Word Doc in SQL Server! Expand / Collapse
Author
Message
Posted Tuesday, May 14, 2013 8:55 AM
Old Hand

Old HandOld HandOld HandOld HandOld HandOld HandOld HandOld Hand

Group: General Forum Members
Last Login: Tuesday, July 1, 2014 3:34 AM
Points: 378, Visits: 886

Hi All
I wish to store a word document in SQL Server. I should be able to search on the content. e.g. Select Content from DocTable where content like '%this doc is good%'.
I want to retain basic formatting like old,underline , italics.
Is there a way to achieve this without saving the document as HTML?
Thanks


-----Table Proc Index Performance TSQL &&%$#@*(#@$%.......------------
Deep Into SQL Jungle
Post #1452650
Posted Tuesday, May 14, 2013 9:02 AM


SSChampion

SSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampion

Group: General Forum Members
Last Login: Today @ 9:20 AM
Points: 12,916, Visits: 32,080
S_Kumar_S (5/14/2013)

Hi All
I wish to store a word document in SQL Server. I should be able to search on the content. e.g. Select Content from DocTable where content like '%this doc is good%'.
I want to retain basic formatting like old,underline , italics.
Is there a way to achieve this without saving the document as HTML?
Thanks


you'll want to use Full text indexing, which has the option to scan various types of documents: you definitely don't reinvent the wheel when someone has built a race car for the same issue.

as far as formatting goes, the formatting in the document is still in place, but it's broken up into search words;
not sure what it is you are after as far as formatting goes.

http://msdn.microsoft.com/en-us/library/ms142571.aspx


Filters. Some data types require filtering before the data in a document can be full-text indexed, including data in varbinary, varbinary(max), image, or xml columns. The filter used for a given document depends on its document type. For example, different filters are used for Microsoft Word (.doc) documents, Microsoft Excel (.xls) documents, and XML (.xml) documents. Then the filter extracts chunks of text from the document, removing embedded formatting and retaining the text and, potentially, information about the position of the text. The result is a stream of textual information. For more information, see Configure and Manage Filters for Search.


Lowell

--There is no spoon, and there's no default ORDER BY in sql server either.
Actually, Common Sense is so rare, it should be considered a Superpower. --my son
Post #1452652
Posted Wednesday, May 14, 2014 1:32 AM
Grasshopper

GrasshopperGrasshopperGrasshopperGrasshopperGrasshopperGrasshopperGrasshopperGrasshopper

Group: General Forum Members
Last Login: Wednesday, May 21, 2014 3:51 AM
Points: 10, Visits: 31
S_Kumar_S (5/14/2013)

Hi All
I wish to
store a word document in SQL Server. I should be able to search on the content. e.g. Select Content from Word DocTable where content like '%this doc is good%'.
I want to retain basic formatting like old,underline , italics.
Is there a way to achieve this without saving the document as HTML?
Thanks


You want to make the search key words contain basic formatting? like this,

http://msdn.microsoft.com/en-us/library/office/aa211953%28v=office.11%29.aspx
Post #1570665
« Prev Topic | Next Topic »

Add to briefcase

Permissions Expand / Collapse