| View previous topic :: View next topic |
| Author |
Message |
jens0331 Newbie

Joined: 16 Apr 2007 Posts: 2
|
Posted: Mon Apr 16, 2007 6:33 am Post subject: Excluding words from the keywors index |
|
|
Hi,
I'm writing a bit longer text in German with a keyword index, and I
have the following problem.
The word 'Spiel' (game) should be indexed, the word 'Beispiel' (example) shouldn't. Because of Beipspiel actually contains the word 'spiel', it is indexed.
How do I exclude those words from the index, which aren't subject to be indexed.
Turning the option "Word only" off, is not acceptable. When I write e.g. 'Rollenspiel' (role game) which also contains the fragment Spiel and it should be index.
My best guess, what I might can do is: To delete the word 'Spiel' completely from the index, but use composite nouns like 'Rollenspiel' in the index.
Any ideas beyond this one? |
|
| Back to top |
|
 |
Robert Tucker Moderator


Joined: 16 Aug 2004 Posts: 3367 Location: Manchester UK
|
Posted: Mon Apr 16, 2007 7:45 am Post subject: |
|
|
You are not going to be able to use standard search functions (even regular expressions) to distinguish between Beispiel and Rollenspiel. You would need to write a program with excluded words and it might not be that simple.
If as you typed in the text you could insert a zero-width non-breaking space between the components of composite words you do want to index (see: www.oooforum.org/forum/viewtopic.phtml?t=49179 ) you might then be able to search for (and index) words containing the non-breaking space and, in the case you cite, "spiel". |
|
| Back to top |
|
 |
jens0331 Newbie

Joined: 16 Apr 2007 Posts: 2
|
Posted: Mon Apr 16, 2007 11:47 am Post subject: |
|
|
Hi,
| Robert Tucker wrote: | | You are not going to be able to use standard search functions (even regular expressions) to distinguish between Beispiel and Rollenspiel. You would need to write a program with excluded words and it might not be that simple. |
Indeed it's a bit hard subject. Out of curiosity I've started Google Desktop to see how it stores keywords. My examples aren't near to 'Spiel'. Beispiel isn't, nor Rollenspiel is.
Maybe programing something myself is the best solution.
<quote="Scotty"> I have spend whole my life, trying to figure out crazy ways of doing things</quote>
The problem with the German language is, that you can build nouns composed of other words. A very famous one is
"Donaudampfschifffartskaptitänsmütze(enhaken)" (Danube steamboat captains - caps hook, but please don't nail me on the translation . You never find those words in any dictionary, but somehow they are correct.
Such a word is 'Rollenspiel', it is composed of Rollen (role) and
Spiel (game). Beispiel isn't such a word, even if it looks alike
I have to admit that, English has a few rules, where German should follow the example,
Maybe, I have another solution. Before finally indexing the text , I do search and replace 'Beispiel' with 'Example', than index the text and replace back. (Or use synonyms, where ever I can do that.)
.oO(Talking to someone else helps to rethink the problem, and >figure out crazy ways of doing things<. That's why I write so much) |
|
| Back to top |
|
 |
|
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum
|
Powered by phpBB © 2001, 2005 phpBB Group
|