Jacob's TN Searchy Thingy

Make suggestions and report problems.
Post Reply
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Jacob's TN Searchy Thingy

Post by JacobH »

Admin2 Note: This thread split from here.

I don’t know if this would be helpful, or not, but I’ve been working on a similar script that I have been using to search through the TN on :ftlop:. I’ve got a copy up here: http://www.jacob-head.com/ftlop/ . It’s password protected, because I still am not quite sure whether Roy and the rest of the FTLOP team are happy with me publicising it, but feel free to try it. The username is: theportforum and the password: NovalNacional.

It can be updated automatically (though at the moment this is inactive so I don’t overuse the :ftlop: servers), which may be an advantage. It’s main disadvantages are that currently it doesn’t have names and dates for each note and it just gives the topic subject, rather than refining it.

I’m not sure whether there is much need for something like that. I’ve occasionally found it useful to use it to search for (say) all the VPs from one year, but perhaps for most uses the basic list will suffice!
User avatar
DRT
Fonseca 1966
Posts: 15786
Joined: 22:51 Wed 20 Jun 2007
Location: Chesterfield, UK
Contact:

Post by DRT »

Jacob,

Is this scanning the :ftlop: users TNs or Roy's own TNs? If it's the former then it seems it is not really much different to a BOT scanning for keywords on any public site. If it's the latter I would imagine Roy will have a major issue with it, especially if you have subscribed to :ftlop: to gain access to the data.

Derek
"The first duty of Port is to be red"
Ernest H. Cockburn
User avatar
DRT
Fonseca 1966
Posts: 15786
Joined: 22:51 Wed 20 Jun 2007
Location: Chesterfield, UK
Contact:

Post by DRT »

I just answered my own question by trying it and finding one of my own TNs!

Have you contacted Roy or Stewart about it? If not I would suggest you do to see what they think.

Derek
"The first duty of Port is to be red"
Ernest H. Cockburn
User avatar
KillerB
Taylor Quinta de Vargellas 1987
Posts: 2425
Joined: 21:09 Wed 20 Jun 2007
Location: Sky Blue City, England

Post by KillerB »

I like that - it's different from what we do but useful nonetheless.
Port is basically a red drink
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

I did, but it was at about a month ago; the same time they were relaunching the site so didn’t get a chance to take it anywhere.

Although, as you say, it affects their site no more than Google, I think it would be a bit rude to advertise it on there without their permission!
Conky
Fonseca 1980
Posts: 1770
Joined: 22:51 Wed 20 Jun 2007

Post by Conky »

A search facility does seem good. To those with technical know how, is it clearly better than the basic search on the green toolbar?
User avatar
DRT
Fonseca 1966
Posts: 15786
Joined: 22:51 Wed 20 Jun 2007
Location: Chesterfield, UK
Contact:

Post by DRT »

Jacob,

Would it take lots of effort to create a version of your index/search facility that points at the TNs on :tpf:?

Obviously it would be good for The :tpf: Collective to agree to allow that to happen but I would be interested to know how easy t would be to make possible.

Derek
"The first duty of Port is to be red"
Ernest H. Cockburn
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

No, it shouldn’t be too difficult…To produce a manually-updated version on my site (like the :ftlop: one) would, touch wood, only take about 10 minutes…Adding names and dates should, I hope, also be quite straight-forward.

Getting it to automatically update would be a bit more complex as it would have to be hosted on the :tpf: server, reading the threads off the forum database. I think that would be a Sunday-afternoon project!
User avatar
DRT
Fonseca 1966
Posts: 15786
Joined: 22:51 Wed 20 Jun 2007
Location: Chesterfield, UK
Contact:

Post by DRT »

JacobH wrote:No, it shouldn’t be too difficult…To produce a manually-updated version on my site (like the :ftlop: one) would, touch wood, only take about 10 minutes…Adding names and dates should, I hope, also be quite straight-forward.

Getting it to automatically update would be a bit more complex as it would have to be hosted on the :tpf: server, reading the threads off the forum database. I think that would be a Sunday-afternoon project!
Interesting! Perhaps others could chip in with thoughts?

Derek
"The first duty of Port is to be red"
Ernest H. Cockburn
User avatar
KillerB
Taylor Quinta de Vargellas 1987
Posts: 2425
Joined: 21:09 Wed 20 Jun 2007
Location: Sky Blue City, England

Post by KillerB »

If you show me how it's done then we will give it a go. I can write SQL queries that will achieve the same thing so see no reason not to achieve it with whatever else we've got.
Port is basically a red drink
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

I made the necessary changes and put a version for :tpf: on site at http://www.jacob-head.com/tpf. For the record, the necessary changes did only take 10 minutes, though working out that the reason that the same 10 bottles were shown 15 times each was not because of a bug in the code but because I’d downloaded the same page 15 times, took considerably more… :opps:

It’s a bit rough-and-ready but hopefully shows what might be possible.

KillerB, the only non-trivial bit of code is this one which takes an arbitrary string and then returns an array containing a guess as to the shipper, type of port and vintage referred to in that string.

The index is generated in three stages, using XML. First we get the raw data, consisting of topic names and urls, and format them into an XML file as follows:

Code: Select all

<forum>
<topic>
<name>Cruz Ruby</name>
<url>http://www.xyz.com/xzy.php</url>
</topic>
</forum>
For these mock-ups, I‘ve been doing this by grabbing the forum index source code with wget and parsing it through some sed scripts.

Stage two is this set of code, which loops through the XML and produces:

Code: Select all

<forum>
<topic>
<name>Cruz Ruby</name>
<type>Ruby</type>
<year>nv</year>
<url>http://www.xyz.com/xzy.php</url>
</topic>
</forum>
Finally, stage three is the index.php (source code) which just takes the XML and displays/manipulates it.

With database access, stage one would be eliminated and I could incorporate stage 2 into the index.php file in a slightly more sophisticated way (so as to make it auto-update).

Alternatively, perhaps there might be some use for the basic portid function in generating the current static list, so you don’t have to do any manual sorting?
User avatar
KillerB
Taylor Quinta de Vargellas 1987
Posts: 2425
Joined: 21:09 Wed 20 Jun 2007
Location: Sky Blue City, England

Post by KillerB »

Thank you Jacob. We will look at ways of getting this directly on the TPF site with access to the Database. At the moment, I need sleep.
Port is basically a red drink
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

I've finally had a chance to patch together some pseudo-php which would be how I would automate the script with database access if such a system might be helpful.

The passages marked **SQL** would, of course, need to be converted into real SQL calls, but as that's very much dependant on your set-up, it seemed easiest to leave them in pseudo-form. It uses my PortID function to do the hard work. I'm just about finishing a version 1.01 which will split crusted, garrafeira and white out of "ruby &c." and do some sub-types (e.g. identify if it is a 10/20/30/40 year-old tawny; ruby or ruby reserve; which sort of white and have a guess at the quinta for sqvp).

Code: Select all

include('./portid.inc');

##We only need to update if there have been new topics.
##How many topics are there now?

**SQL**
$currenttopics = SELECT forum_topics FROM phpbb_forums WHERE forum_id = 2;

##How many topics where there last time?

$oldtopics=file_get_contents('numberoftopics.txt');

##If there are now more topics than last time, then we rebuild the database
if($currenttopics > $oldtopics){


##Create new XML document using the DOM
$xml = new DomDocument('1.0');

##Build an array of topic numbers from the TN forum

$arrayoftopics=array();

**SQL**
$arrayoftopics = SELECT topic_id FROM phpbb_topics WHERE forum_id = 2;

##Loop through the array

foreach($arrayoftopics as $currenttopic){

        $newtopic=$xml->addChild('topic');

        **SQL**
        $topicname = SELECT topic_title FROM php_topics WHERE topic_id =
$currenttopic;

        $newtopic->addChild('name', $topicname);

        $newtopic->addChild('url',
'http://www.theportforum.com/viewtopic.php?t='.$currentopic );

        $portid=portid($topicname);

        $newtopic->addChild('year', $portid[0]);
        $newtopic->addChild('producer', $portid[1]);
        $newtopic->addChild('type', $portid[2]);

        **SQL**
        $topictime = SELECT topic_time FROM php_topics WHERE topic_id =
$currenttopic;

        $newtopic->addChild('time', $topictime);

        **SQL**
        $topic_poster = SELECT topic_poster FROM php_topics WHERE topic_id = $currenttopic;

        $newtopic->addChild('author', $topic_poster);

}

file_put_contents('topics.xml',$xml->asXML());
}

***here continues the index.php file from previous post to display the database***
User avatar
benread
Niepoort 1977
Posts: 1577
Joined: 20:36 Thu 17 Apr 2008
Location: Reigate, Surrey
Contact:

Post by benread »

Jacob,

Would this search all tasting notes or just those already indexed in the "A-Z"?
Ben
-------
Vintage 1970 and now proud owner of my first ever 'half-century'!
User avatar
KillerB
Taylor Quinta de Vargellas 1987
Posts: 2425
Joined: 21:09 Wed 20 Jun 2007
Location: Sky Blue City, England

Post by KillerB »

Jacob,

The SQL is pretty accurate. I do SQL, I don't do php, Treacle does a bit of php but doesn't do SQL :roll:

Let me know what you need to implement it and we will see what we can get done,

Cheers,

Alex
Port is basically a red drink
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

Ben, yes, the idea is that it will automatically index the posts as they are posted. There are a few difficulties in doing it automatically but thankfully as the number of Port shippers is quite low (120 or so), its not too hard to put in individual exceptions, even if we do have more varieties of wine than other regions.

Alex, that’s a shame, though completely understandable as getting php and SQL to co-operate nicely is complete pain! I can do a bit of php and a bit of SQL at the terminal, but hate doing them together.

What might be useful to know is what database you are running as the php commands are completely different for PostgreSQL and MySQL. It would also be helpful to know if those SQL calls will work on the set-up here (e.g. with table names etc.) or if I’ve made any stupid mistakes with them (it’s a while since I’ve SQLed in anger).

I’ll try to finish off the new version of PortID and sort out the index page so it’s a bit more efficient. It’ll then just be a case of stringing the lot together, uploading it and seeing if it works :D
User avatar
DRT
Fonseca 1966
Posts: 15786
Joined: 22:51 Wed 20 Jun 2007
Location: Chesterfield, UK
Contact:

Post by DRT »

This thread has taken on a new meaning and I would suggest that "Jacob's TN searchy thing" deserves it's own thread so that this one can return to being a prompt to KillerB and Jdaw to get moving on updating the TN Index.

Any objections to splitting from the point where Jacob first mentioned his searchy thing?

Derek
"The first duty of Port is to be red"
Ernest H. Cockburn
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

That makes sense to me. Fire away! Image
User avatar
DRT
Fonseca 1966
Posts: 15786
Joined: 22:51 Wed 20 Jun 2007
Location: Chesterfield, UK
Contact:

Post by DRT »

JacobH wrote:That makes sense to me. Fire away! Image
Done. :wink:
"The first duty of Port is to be red"
Ernest H. Cockburn
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

Although the logs suggest no-one was using it, just to let people know, I have deleted the user account for the :FTLOP: notes, as I have been informed that my posting it here was, unfortunately, causing a certain amount of annoyance at :FTLOP:.
Conky
Fonseca 1980
Posts: 1770
Joined: 22:51 Wed 20 Jun 2007

Post by Conky »

JacobH wrote:Although the logs suggest no-one was using it, just to let people know, I have deleted the user account for the :FTLOP: notes, as I have been informed that my posting it here was, unfortunately, causing a certain amount of annoyance at :FTLOP:.
It does... :roll:

But dont worry, 99% of us are all friends. :D
Roy Hersh
Niepoort LBV
Posts: 283
Joined: 20:55 Mon 31 Dec 2007

Post by Roy Hersh »

Thank you Jacob! No harm, no foul, as it was more of an "ethics" issue for me, not one with TPF. But that is firmly behind us now.

As to the other 1%, Alan alludes to ... he may actually someday fill that void, once we meet in person and I am assured his service revolver is not loaded. I am still unsure of his ability to handle copious amounts of VP in one sitting. 88) But that is what offlines are for, to get to know folks in person where on line personas may not be as endearing. I am sure that of course, works both ways. :D
Conky
Fonseca 1980
Posts: 1770
Joined: 22:51 Wed 20 Jun 2007

Post by Conky »

Fair comment Roy,

and sadly I would be at a grave disadvantage if Andy was there. We do not carry guns as a matter of course. I have other methods...but none of them out run a speeding bullet. :D
User avatar
Alex Bridgeman
Fonseca 1966
Posts: 15922
Joined: 12:41 Mon 25 Jun 2007
Location: Berkshire, UK

Post by Alex Bridgeman »

Roy

One of the weaknesses of FTLOP at present is that there is a great wealth of tasting notes from the forum members, but no index or easy way to find a note for a particular port (this is the only reason that I do not post tasting notes on FTLOP very often - once I post them it is almost impossible to refer back to them after a few days have gone by). Jacob's tasting note search string could help you to fill this gap relatively quickly...

Alex
Top Ports in 2024: Niepoort 1900 Colheita, b.1971. A near perfect Port.

2025: Quevedo 1972 Colheita, b.2024. Just as good as Niepoort 1900!
Conky
Fonseca 1980
Posts: 1770
Joined: 22:51 Wed 20 Jun 2007

Post by Conky »

For a Computer half-wit like myself, what stage are we up to? Any estimated time-scale, or is it just when a few of you get round to it (Which I completely understand)
User avatar
KillerB
Taylor Quinta de Vargellas 1987
Posts: 2425
Joined: 21:09 Wed 20 Jun 2007
Location: Sky Blue City, England

Post by KillerB »

Conky wrote:For a Computer half-wit like myself, what stage are we up to? Any estimated time-scale, or is it just when a few of you get round to it (Which I completely understand)
Thank you for understanding.
Port is basically a red drink
Roy Hersh
Niepoort LBV
Posts: 283
Joined: 20:55 Mon 31 Dec 2007

Post by Roy Hersh »

Alex,

You are preaching to the choir. I've been begging a friend of mine for 9 months, if not longer. I'd be happy to have my buddy utilize Jacob's talents and hopefully that will happen at some point sooner rather than later.
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

I’m currently in the midst of exams which has had a somewhat negative effect on my ability to (in roughly order of priority): a) consume Port, b) reply to PMs/emails (apologies!) and c) mess around with bits of computer code. However, I’m just about getting there with sorting this out…hopefully it’ll be done by the end of next week (hangovers caused by celebrating the end of the worst course ever devised notwithstanding).

Anyway, I could do with a little assistance with the names of the shippers and was wondering if people here could help me? I have below a list of the ones which I have included so far and I was wondering if I could have any corrections/additions/removals. The particular problem is that so many obscure bottles have been drunk on the internet that I am soon out of my depth in terms of knowledge!

I’m also hoping to include secondary labels and single quintas. The ones I have so far are marked with a +.

The single * denotes what I think is a non-Douro shipper and a ** denotes a BOB company.

Code: Select all

Adriano Ramos-Pinto
Aguias
Allesvorloren*
Antonion Jose Da Silva
Barros
Barão de Vilar
Berry Bros. Own Brand**
Bodega De Leon
Boplaas Cape*
Borges
Broadbent
Buller
Burmester
Butler
Calem
Carvalhas
Casal does Jordões
Champalimaud
Churchhill
+Quinta da Gricha
Cockburn
+Quinta dos Canais
Croft
Cruz
Dalva
Delaforce
Diez Hermanos
Dow
+Quinta do Bomfim
+Senhora da Ribeira
Dutschke*
Feist
Ferreira
Feurheerd** [? does this exist...I have no idea where I got the name!]
Fiin Gammel*
Fonseca
+Guimaraens
Gonzalez Bypass*
Gould Campbell
Graham
+Quinta dos Malvedos
Harvey
Hooper
Hutcheson
James Eadie
KWV
Kopke
Krohn
Landskroon*
Mackenzie
Martinez
Massandra Lavidia [? Another unknown, possibly a typo]
Messias
Moreira [note to self…check Moriera typo]
Morgadio da Calçada [not to self…check Morgado typo]
Morgan Brothers**
Napa*
Niepoort
+Secundum
Offley Forrester
Osborn
Penfold*
Pintas
Porto Pocas 
Porto Rocha
Pousada
Presidential
Quarles Harris
Quinta Infantado
Quinta Nova de Nossa Senhora do Carmo
Quinta Valle Longe
Quinta da Cavadinha
Quinta da Eira Velha
Quinta da Fonte Nova
Quinta da Foz
Quinta da Prelada
Quinta da Romaneira
Quinta da Ventozelo
Quinta de Baldias
Quinta de Brunheda
Quinta de Roriz
Quinta de la Rosa
Quinta do Castelinho
Quinta do Crasto
Quinta do Fojo
Quinta do Javali
Quinta do Mourão
Quinta do Noval
+Nacional
Quinta do Passadouro
Quinta do Portal
Quinta do Rominera
Quinta do Tedo
Quinta do Vale
Quinta do Vesuvio
Quinta do Vista
Qunita do Loureiro
Qunita do Sibio
Rebello
Romariz
Rovalley*
Royal Companhia
Royal Oporto
Rozès
Sandeman
Santa Eufemia
Smith Woodhouse
Symington
Sâo Pedro Dad Aguias
Taylor, Fladgate and Yeatman
+Quinta de Terra Feita
+Quinta de Vargellas
Thorn-Clarke
Vallegre
Warre
+Quinta da Cavadinha
Finally, I am also looking to produce a list of all possible styles of Port which are likely to ever come up. This, hopefully, should be easier. I should mention that my main aim is to reflect the reality of what has been produced, rather than slavishly follow the IDVP guidlines. So, has anyone come across a style of Port which is not listed below. (Sub-categories are given a †+†).

Code: Select all

Colheita
Crusted
Garrafeira
Late Bottled Vintage
Ruby
+Reserve
Single Quinta Vintage
Tawny
+10 Year Old
+20 Year Old
+30 Year Old
+40 Year Old
Vintage
White
+Colheita
+Dry  
+Extra Dry
+Lagrima
+Leve Seco
+Medium Sweet
+Sweet
Many thanks!
User avatar
DRT
Fonseca 1966
Posts: 15786
Joined: 22:51 Wed 20 Jun 2007
Location: Chesterfield, UK
Contact:

Post by DRT »

Jacob,

Jdaw1 is also in the process of compiling a definative list of shippers names and should be able to provide you with that list by email or PM or by posting it here.

On the styles of port I would suggest you add the following:

Vintage Character

Late Bottled Vintage
+Unfiltered
+Bottle Matured
+Traditional

Ruby
+Unfiltered
+Special Reserve

Pink (Yuck!)

White
+Colheita

Colheita
+White

[choose one or the other of the above for your list. I think the second version is more accurate]

Please also note that in the Tawny sub-categories it should be "Over 40 Years Old". The other 3 Tawny sub-categories are ok.

Derek
"The first duty of Port is to be red"
Ernest H. Cockburn
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

Derek T. wrote:Jdaw1 is also in the process of compiling a definative list of shippers names and should be able to provide you with that list by email or PM or by posting it here.
Ah, that would be very helpful. Thanks for letting me know!

In terms of styles:

Vintage Character; an interesting suggestion as I think this term is quite often used to describe Rubies. I'll see if I can sort out the filters to get it to work, though. The only TN I can find for a non-ruby version of it was from an off-line last May.

Late Bottled Vintage; these would certainly make sense (as long as people put a note to that effect in their titles!)

Ruby; certainly can add "unfiltered". I wonder whether "special reserve" denotes anything of consequence beyond "reserve" or whether it would balkanise the Ruby results too much?

Pink (Yuck!); might follow the IVDP's lead on this one and put it under "Ruby"! :D

White Colheitas. Hmm...I did wonder about this one. I think the question is: what makes more sense for someone trying to look for White Colheita TNs. Would you look first under "White" or first under "Colheita"?

"Over 40 Years Old"; fair point (though, bizarrely, "40 years old" is correct across the pond!).

Thanks for the suggestions!
User avatar
DRT
Fonseca 1966
Posts: 15786
Joined: 22:51 Wed 20 Jun 2007
Location: Chesterfield, UK
Contact:

Post by DRT »

JacobH wrote: Vintage Character; an interesting suggestion as I think this term is quite often used to describe Rubies. I'll see if I can sort out the filters to get it to work, though. The only TN I can find for a non-ruby version of it was from an off-line last May.
That is a TN of a ruby. Vintage Character is a now outlawed description for a style of premium ruby. It has now been replaced by the Reserve classification.

Derek
"The first duty of Port is to be red"
Ernest H. Cockburn
User avatar
Alex Bridgeman
Fonseca 1966
Posts: 15922
Joined: 12:41 Mon 25 Jun 2007
Location: Berkshire, UK

Post by Alex Bridgeman »

Jacob,

Do you also need us to provide you with all the variations that we have seen for these names? Although if the search code that you are writing is intended for the moment to only operate on TPF then we have been fairly strict with the naming conventions that we have used for our titles so that any search engine work is made as simple as possible.

Alex
Top Ports in 2024: Niepoort 1900 Colheita, b.1971. A near perfect Port.

2025: Quevedo 1972 Colheita, b.2024. Just as good as Niepoort 1900!
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

Alex, variations would be good: I intend half the script to be portable so it can be used for other projects (such as building a cellar database from a plain text list) where more liberal naming conventions will apply.

Also, despite everyone’s best efforts, I already need to include quite a few spelling corrections in my script to deal with common errors (particularly the over-zealous application of the “I before E† rule to Portuguese!)

-Jacob
User avatar
jdaw1
Dow 1896
Posts: 24574
Joined: 14:03 Thu 21 Jun 2007
Location: London
Contact:

I have code that will parse the names in our TN database.

Post by jdaw1 »

I have code that will parse the names in our TN database.
User avatar
Alex Bridgeman
Fonseca 1966
Posts: 15922
Joined: 12:41 Mon 25 Jun 2007
Location: Berkshire, UK

Post by Alex Bridgeman »

JacobH wrote:I’m also hoping to include secondary labels and single quintas. The ones I have so far are marked with a +.

The single * denotes what I think is a non-Douro shipper and a ** denotes a BOB company.
Adriano Ramos-Pinto
or Ramos Pinto
Adams**
Aguias
Allesvorloren*
or Allesverloren or Allesveloren
Andresen

Antonion Jose Da Silva
or Antonio Jose Da Silva or AJ da Silva or AJS
Avery**
Barros
Barão de Vilar
Berry Bros. Own Brand**
or Berry Brothers & Rudd or BB&R or BBR
Bodega De Leon
Boplaas Cape*
or Boplaas
Borges
Bredell*
Broadbent
Buller
Burmester
Butler
Calem
Carvalhas
Casal does Jordões
or Casal dos Jordoes
Chammisso

Champalimaud
Churchhill
or Churchill
+Quinta da Gricha
Cockburn
+Quinta dos Canais
Constantino
Croft
+Quinta da Roeda
Cruz
Dalva
De Kraans*
Delaforce
Diez Hermanos
Dolamore
Dow
+Quinta do Bomfim
+Senhora da Ribeira
Dutschke*
Feist
Ferreira
Feurheerd** [? does this exist...I have no idea where I got the name!]
Feurheerd does exist but is spelt Feuerheed and has many misspellings common
Fiin Gammel*
Fonseca
+Guimaraens
Gonzalez Bypass*
Gonzalez Byass is a Douro shipper
Gould Campbell
Graham
+Quinta dos Malvedos
Harvey** - were the sole UK agents for Martinez for a while
Hedges & Butler**

Hooper
Hutcheson
James Eadie** - a Birmingham based merchant
KWV* - a South African producer
Kopke
Krohn or Wiese & Krohn
Landskroon*
Mackenzie
Martinez
Massandra Lavidia [? Another unknown, possibly a typo]This is correct, but should be a * port as it comes from the Crimea
Messias
Moreira [note to self…check Moriera typo]
Morgadio da Calçada [not to self…check Morgado typo]
Morgan Brothers**Should not be ** - this is a shipper now owned by Taylors, but is frquently referred to simply and Morgan
Napa*
Niepoort
+Secundum
Offley Forrester or Offley or Offley Boa Vista or Boa Vista
Osborn or Osborne
Overgaauw*

Penfold*
Pintas
Porto Pocas or Pocas
Porto Rocha or Rocha
Pousada
Presidential
Quarles Harris
Quinta Infantado more correctly, Quinta do Infantado
Quinta Nova de Nossa Senhora do Carmo more correctly Quinta da...
Quinta Valle Longe more correctly, Quinta de...
Quinta da Cavadinha
Quinta da Eira Velha
Quinta da Fonte Nova
Quinta da Foz
Quinta da Prelada
Quinta da Romaneira
Quinta da Ventozelo
Quinta de Baldias
Quinta de Brunheda
Quinta de Roriz
Quinta de la Rosa
Quinta do Castelinho
Quinta do Crasto
Quinta do Fojo
Quinta do Javali
Quinta do Mourão
Quinta do Noval or QdN
+Nacional or QdNN or NN
+Silval or Noval Silval

Quinta do Passadouro
Quinta do Portal
+"Portal+" or "Portal +" - "Portal+" is Portal's answer to Sandeman Vau
Quinta do Rominera or Romaniera
Quinta do Tedo
Quinta do Vale
Quinta do Vesuvio
Quinta do Vista
Qunita do Loureiro
Qunita do Sibio
Quinta da Silval (or is it do or de...?)
Rebello
Romariz
Rovalley*
Royal Companhia
Royal Oporto
Rozès
Sandeman
Santa Eufemia
Smith Woodhouse
Symington
Sâo Pedro Dad Aguias
Taylor, Fladgate and Yeatman or Taylor
+Quinta de Terra Feita
+Quinta de Vargellas
++Quinta de Vargellas Vinho Velha
Thorn-Clarke
Vallegre
Warre
+Quinta da Cavadinha
My main suggestion would be to remove the "Quinta da/de/do/dos" from all the names that currently contain them. The only reason for this suggestion is that these are frequently typed wrong, so you would have to be able to identify all the variations on a theme - eg. Quinta da Vesuvio, Quinta de Vesuvio, Quinta do Vesuvio or Quinta dos Vesuvio. It's probably easier just to search out "Vesuvio".
Top Ports in 2024: Niepoort 1900 Colheita, b.1971. A near perfect Port.

2025: Quevedo 1972 Colheita, b.2024. Just as good as Niepoort 1900!
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

Thanks for that. That’s extremely helpful.

The “Quinta† issue is not particularly serious as the script looks for the main notable word in the title (e.g. “Noval†, “Quinta Da Noval† and even something like “Chateux Noval† would all produce “Quinta Do Noval† as a result) so any mistakes in Portuguese grammar will be ignored!

The three names which I’m a bit worried about are: Ramos-Pinto, Porto Pocas and Porto Rocha. I think the full name “Adraino Ramos-Pinto† is used so rarely that it can safely be filled under “R†, but I’m less sure about which of Porto Pocas/Pocas and Porto Rocha/Rocha is most common. For the former, I suppose it doesn’t really matter as it will end up in pretty much the same place, but I’m less sure about Porto Rocha.

-Jacob
User avatar
jdaw1
Dow 1896
Posts: 24574
Joined: 14:03 Thu 21 Jun 2007
Location: London
Contact:

Poças, surely.

Post by jdaw1 »

JacobH wrote:Porto Pocas/Pocas
Poças, surely. (Though my code strips away all diacritical marks before comparison, adding them back for recommended names.)
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Re: Poças, surely.

Post by JacobH »

jdaw1 wrote:
JacobH wrote:Porto Pocas/Pocas
Poças, surely. (Though my code strips away all diacritical marks before comparison, adding them back for recommended names.)
Ah, yes, indeed :D

That’s the approach I’ve been taking, though I think there might be some clever PHP function which does it automatically. I might have a look into that!

I think I’m just about getting there, having written from scratch for the second time…The maxim “You have a problem and decide to solve it by XML. You now have two problems.† comes to mind :roll:

-Jacob
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

Finally, I think I’ve made some progress on this script and turned it into something which might be useful.

I’ve uploaded a new version to http://www.jacob-head.com/tpf and would be grateful for any feedback.

The main changes are:
1) Additions to and corrections of the details of a number of producers.
2) Ditto for various Port styles.
3) New search facility (so you can search for all/any of the words in a phrase).
4) Allow the drop-down menus to combine with search box (i.e. you can now search for a term within all Colheitas).
5) Switching to XPath queries rather than loops, which has resulted in it being much faster.

Once everyone is happy with how it looks, I can then add the necessary bits of a code to make it automatically update when someone posts.

Things that could be added include: original poster’s name; date and time of the post; highlighting of non-Douro/BOB shippers and changes to the formatting. Let me know what you think about these.

-Jacob
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

Drat, I’ve just noticed that “Nacional† without “Noval† is now not being picked up, for some reason. I’ll correct that when I get a chance (I think it was caused by not wishing to get false positives on single-variety Touriga Nacional ports).
Conky
Fonseca 1980
Posts: 1770
Joined: 22:51 Wed 20 Jun 2007

Post by Conky »

Just had a play with it....Excellent.

I would love a score option, but it's very good as it is.

If I've praised it enough? A suggestion. The look of it needs jazzing up. That funny paisley design doesn't do it for me, and it might benefit from more stylish and distinctive font. But of course that is secondary to doing its job, which it does.

Alan.
User avatar
RonnieRoots
Fonseca 1980
Posts: 1981
Joined: 07:28 Thu 21 Jun 2007
Location: Middle Earth

Post by RonnieRoots »

Great work Jacob!

I would like to see the poster's name and original post date included. That would give a better overview if there are more than one results for a specific port. A couple of remarks on producer names:

Gonzalez Bypass (althoug funny) must be Gonzales Byass

Osborne

Poças is listed twice, use either Poças or Porto Poças

Quinta do Infantado

Quinta do Vale D. Maria

Quinta do Vista must be Vista Alegre
(this is not a single quinta wine)

That's all!
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

Conky wrote:Just had a play with it....Excellent.

I would love a score option, but it's very good as it is.

If I've praised it enough? A suggestion. The look of it needs jazzing up. That funny paisley design doesn't do it for me, and it might benefit from more stylish and distinctive font. But of course that is secondary to doing its job, which it does.

Alan.
Thanks! I haven't put any thought into the styling of it, though that is very easy to change. At the moment, it's just using the basic house-style of the website where it is stored, which looks pretty horrific if you are using Internet Explorer rather than Firefox. If we put it on :tpf: we can easily incorperate the current style of the site, or do something different.

Incidentally, the "funny paisley design" is an original tile design by Augutus Pugin from The True Principles of Pointed, or Christian Architecture :P

In terms of the score, that's a bit difficult to do automatically, because the scores aren't in an obvious (for a computer) place. We could look at doing something more complex to try to resolve that (e.g. by manually adding them for existing notes and then having them in a specific place for new ones) but that might have to wait until a version 2.
Last edited by JacobH on 13:51 Tue 17 Jun 2008, edited 2 times in total.
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

RonnieRoots wrote:Great work Jacob!

I would like to see the poster's name and original post date included. That would give a better overview if there are more than one results for a specific port. A couple of remarks on producer names:

Gonzalez Bypass (althoug funny) must be Gonzales Byass

Osborne

Poças is listed twice, use either Poças or Porto Poças

Quinta do Infantado

Quinta do Vale D. Maria

Quinta do Vista must be Vista Alegre
(this is not a single quinta wine)

That's all!
Thanks! I'll make those changes when I'm next at my computer.

I think this has nicely demonstrated that I can neither spell nor type :D
User avatar
Alex Bridgeman
Fonseca 1966
Posts: 15922
Joined: 12:41 Mon 25 Jun 2007
Location: Berkshire, UK

Post by Alex Bridgeman »

I think this is great.

My only request is for another sort button - by vintage.

I do note that where there is a thread (as opposed to a single posting on a tasting) then there can be repeats of the post in the index that comes back.

To illustrate this, try doing a search on "Berry Brothers Selection" and "Vintage Port" as the type and take a look at the two notes that come back for the 1970 vintage.

Thanks!
Top Ports in 2024: Niepoort 1900 Colheita, b.1971. A near perfect Port.

2025: Quevedo 1972 Colheita, b.2024. Just as good as Niepoort 1900!
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

Alex, I didn't think there would be much interest in that functionality, but it should be quite easy to sort that out.

As for the second issue, I don't think there's any easy way around it but I don't think there are too many duplicates in the forum. Perhaps it will be improved once I add dates/original poster?
User avatar
JacobH
Quinta do Vesuvio 1994
Posts: 3300
Joined: 15:37 Sat 03 May 2008
Location: London, UK
Contact:

Post by JacobH »

I’ve had a chance to make the changes which have been suggested and re-uploaded it to the same place: http://www.jacob-head.com/tpf.

The “by vintage† function doesn’t work entirely as I would wish, but I think it’s generally ok. The post’s author and the date of the post don’t appear on my website because the data I am using to generate the index doesn’t include it, but it should work once uploaded to :tpf:.

I’ve also added a little javascript which, once a producer is selected, blanks out years and types which do not exist for that producer. (This might be a bit bandwidth heavy, though, so may have to be removed).

Finally, I’ve added the bits of code necessary to make it automatically update, so, perhaps we could look at moving to the next stage (of setting it up on the :tpf: servers) unless there are any more comments or changes? (Which would be more than welcome :))

-Jacob
Post Reply