Wikipedia:WikiProject Chemistry/IRC discussions/5 Feb 2008
From Wikipedia, the free encyclopedia
--- Log opened Tue Feb 05 10:32:35 EST 2008
10:38 <+Physchim62> morning dmacks :)
10:38 <dmacks> hello
10:38 <dmacks> (not quite awake yet, waiting for coffee to brew:)
10:39 * Physchim62 passes dmacks a cup of coffee
10:39 <dmacks> *phew*, that's better. Thanks!
10:39 <+Physchim62> be careful, it was made with a Soxlet ;)
10:41 * Physchim62 can never remember how to spell that
10:42 <dmacks> hehe yeah. I think there's an "h" in it or something?
10:45 -!- ChemSpiderMan [n=ChemSpid@c-68-33-151-242.hsd1.md.comcast.net] has joined #wikichem
10:46 <ChemSpiderMan> I will be here but am going on to a conference call at 11am EST (the time this meeting starts) so my contributions will only start when I get off the call I'm afraid
10:46 <+Physchim62> ok
10:46 <ChemSpiderMan> Did you see all of my PDFs?
10:47 * dmacks didn't get a chance to look this week.
10:47 <+Physchim62> I've not seen the latest ones, but
10:47 <+Physchim62> I've seen up until about ten days ago, I think
10:47 <ChemSpiderMan> http://www.chemspider.com/filex/Wikipedia_reports/Wikpiedia_Review_project_020108.pdf
10:48 <+Physchim62> that one I've got
10:48 <+Physchim62> no, I haven't
10:48 * dmacks d/l & reads....
10:48 <ChemSpiderMan> I'm putting further work on hold until we make some decisions around the following:
10:49 <ChemSpiderMan> Representation of sugars
10:49 <+Physchim62> fire away
10:49 <dmacks> "HTTP401-permission denied" on that file.
10:49 <ChemSpiderMan> Primary keys - article versus structure versus CAS and CAS validation (topic for today)
10:49 <ChemSpiderMan> hmmm....
10:49 <ChemSpiderMan> Andrew...can you get to it?
10:50 <+Physchim62> I'm worried about the questions of protonation, esp. on biochemicals, which I think we will be discussing today as well
10:50 <ChemSpiderMan> dmacks...email address? I'll send it to you
10:50 <dmacks> dmacks@towson.edu
10:50 <dmacks> thx
10:51 <ChemSpiderMan> sent
10:51 <ChemSpiderMan> ok...yes: protonation is important
10:51 <+Physchim62> and difficult
10:51 <ChemSpiderMan> Hmmm...I think the majority is pretty easy actually.
10:52 <ChemSpiderMan> There are some problematic cases but it's 80/20
10:52 <+Physchim62> I sometimes get the impression that we're looking for a solution that -chemically- isn't there
10:52 <+Physchim62> especially for biochemicals which attached phosphate groups!
10:52 <+Physchim62> *with
10:53 <ChemSpiderMan> I hear ya....
10:53 <+Physchim62> ie, we will *never* be able to get a unique InChI or CAS, because the compound exists as a eqm mixture ;)
10:54 <ChemSpiderMan> But this again is where we come down to whether or not we are creating the tie to the structure or not
10:54 <+Physchim62> actually, CAS is simpler than InChI in that respect: CAS doesn't care about the chemical nature of the substance it describes ;)
10:54 <ChemSpiderMan> I mean...what's the structure of water?
10:54 <ChemSpiderMan> It's not just H2O
10:55 <ChemSpiderMan> same issue for me
10:55 <ChemSpiderMan> That's why a decision has to be made.
10:55 <ChemSpiderMan> For sugars...we can draw them many ways.
10:56 <ChemSpiderMan> What way should we "decide on" so we can get it done
10:56 <ChemSpiderMan> Staying stuck is going to be easy :-)
10:56 <dmacks> yeah:/
10:56 <ChemSpiderMan> I have 150 molecules now in PDFs for CAS validation...
10:57 <+Physchim62> but "get it done" has a sound of set in stone... there are four cyclic structures which can accurately be described as glucose, how should we present them?
10:58 <+Physchim62> D-glucose is "more important" than L-glucose, but an encyclopedia needs to say what the difference is!
10:58 <dmacks> Comes back to "what are we actually talking about" by "a chemical"...structure, group of structures (mixtures), group of structures (different forms of same structure), etc.
10:58 <ChemSpiderMan> we should present them as we "decide to"
10:58 <ChemSpiderMan> Because if we don't decide it won't get done and we stay where we are
10:58 <ChemSpiderMan> which is better?
10:59 <+Physchim62> agreed, it is an "editorial decision", there is no shame in admitting it!
10:59 <ChemSpiderMan> I have to make decisions like this about 20 million molecules.
10:59 -!- walkerma [n=chatzill@admin-151-108.potsdam.edu] has joined #wikichem
10:59 <ChemSpiderMan> I ask for opinions but then have to decide and move forward
10:59 <ChemSpiderMan> Martin...hi...
10:59 <dmacks> hello walkerma
10:59 <walkerma> Hello!
11:00 <ChemSpiderMan> Just to let you know I will be here but am on a conf call at 11am and will only be able to watch...will participate when I get off..
11:00 <ChemSpiderMan> the call
11:00 * dmacks will put the past few minutes of "hallway discussion" in the log.
11:00 <walkerma> OK, I understand
11:01 <+Physchim62> dmacks, you're logging?
11:01 <dmacks> yup
11:01 <+Physchim62> good :)
11:02 <walkerma> Does anyone know if Rifleman82 is coming? Itub?
11:02 <ChemSpiderMan> oops..I thought Andrew was on here...no wonder he didn't answer :-)
11:05 <+Physchim62> no idea
11:06 <walkerma> OK, I've sent Andrew a quick email. Should we get started?
11:06 * Physchim62 proposes that Martin takes the Chair ;)
11:06 <dmacks> Itub was active on WP a minute a go, so might be along...
11:07 <walkerma> OK, I just left Itub a message.
11:07 <walkerma> I was wondering if we could discuss CAS # validation
11:07 <walkerma> which seems very challenging for us
11:07 <walkerma> How should we proceed?
11:07 <+Physchim62> what is the challenge?
11:08 <ChemSpiderMan> Phone call is delayed until 11:15am..I am here. The challenge is validation. We have CAS numbers associated with structures. Are they correct
11:08 <walkerma> The challenge: We have at least 6000 compounds to check against original ACS information (not the Aldrich catalogue!)
11:09 <walkerma> It is a very slow process, and most of us (me included) don't have good access to SciFinder
11:09 <+Physchim62> no, we have CAS numbers *and* structures associated with *compounds*
11:09 <ChemSpiderMan> Searching on the CSA number to check the structure is the wrong way around also
11:09 <ChemSpiderMan> ASSUMING the structure/compound/substance shown in the Chembox is correct then what is a "representative" CAS number
11:10 <walkerma> Sorry, I'm still in InChI mode - thanks for the correction Antony. Are there other (legal ways) we can get CAS #s for SUBSTANCES?
11:10 <walkerma> besides Scifinder? That would be faster?
11:10 <+Physchim62> yes, there are 1001 ways to get them for /most/ of the compounds on WP
11:10 <ChemSpiderMan> there's only one "trusted way" I believe
11:11 <ChemSpiderMan> search the CAS registry using one of their tools
11:11 <+Physchim62> CAS #s are used as identifiers by many, many organizations which place info on the web: sometimes they get them wrong, of course, but we needn't be restricted to the CAS database
11:11 <ChemSpiderMan> I agree that there are good sources online. But how to sort the wheat from the chaff?
11:12 <ChemSpiderMan> CAS is the "authority"
11:12 <+Physchim62> if the "wrong" CAS no. is widely used, that is a piece of info which should be included in WP
11:12 <walkerma> I disagree, PC: If we want a VALIDATED set of data we should go to the authority
11:12 <ChemSpiderMan> it's expensive and time consuming to do though
11:12 <dmacks> "also often (incorrectly) listed as 12345-67-8"?
11:13 <dmacks> Needn't be part of our validated dataset, but useful note for WP.
11:13 -!- itub [n=tubert@lalo.chemie.unibas.ch] has joined #wikichem
11:13 -!- mode/#wikichem [+v itub] by ChanServ
11:13 <walkerma> dmacks: That would be nice, but the most important thing is getting the main CAS# right
11:13 <+Physchim62> replace "incorrectly" by "which is the number for LSD"
11:13 <dmacks> Right.
11:14 <walkerma> Itub: Welcome! We're talking about validation of CAS#s on WP
11:14 <+itub> hi
11:14 <+Physchim62> hi itub
11:14 * dmacks agrees that the real data is more important than other stuff.
11:15 <walkerma> I chatted with PC earlier, and wondered how they do this at ACS Pubs and at CAS
11:15 <+Physchim62> I disagree. our users are going to use these numbers to search for data elsewhere: if they use SciFinder, they will realise the mistake, but if they use other databases how would they know that the number was "wrong" if we're using the nulber which is normally used?
11:15 <walkerma> Does anyone know what tools THEY use? They have a lot more than 6000 to do
11:16 <walkerma> Physchim62:Fair point
11:18 <+itub> they probably have people working full time on that
11:18 <+itub> doesn't sound like a fun job
11:18 <ChemSpiderMan> don't understand the question? How does ACS pubs get CAS numbers?
11:18 <+Physchim62> we risk opening up a new field of dispute similar to the ones we had over the "correct" IUPAC name.
11:18 <+Physchim62> they ask CAS to assign them a number
11:18 <ChemSpiderMan> ah yes...for sure
11:19 <+Physchim62> there are CAS numbers for compounds which have never been published
11:19 <ChemSpiderMan> propehtic compounds
11:19 <ChemSpiderMan> prophetic.
11:19 <ChemSpiderMan> specifically for patents
11:19 <+Physchim62> soon to be patented compounds....
11:20 <+Physchim62> and also, for compounds whose description is still in the bowels of a journal's editorial office ;)
11:20 <ChemSpiderMan> yup
11:20 <walkerma> The question is: I'm at CAS. If I'm abstracting the 200 compounds described in a J. Org Chem paper, how do I check the CAS nos. for ths#s for those 200?
11:20 <walkerma> How do I know if any of them are new?
11:20 <ChemSpiderMan> draw them
11:21 <ChemSpiderMan> and search them
11:21 <+Physchim62> I believe they also assign numbers to hypothetical compounds, but I have no source for that
11:21 <walkerma> They have to be able to do that with lightning speed.
11:21 <ChemSpiderMan> Of course...i don't work there so I am making this up
11:21 <+itub> that's why people at CAS invented canonicalization
11:21 <+Physchim62> CAS has a *lot* of people working for it ;)
11:21 <walkerma> Yes, but surely they've streamlined things for efficiency?
11:22 <ChemSpiderMan> not really...when an article is submitted many come as images in documents and the OLE objects can easily be opened in an editor in general
11:22 <ChemSpiderMan> So, a word document with embedded Chemdraw or ISIS files can be opened in a structure editor.
11:22 <ChemSpiderMan> Word document ARE structure searchable...
11:22 <+Physchim62> I could check 200 compounds in a morning's work
11:22 <walkerma> They always ask for your original ChemDraw files - do they use those over at CAS?
11:23 <ChemSpiderMan> Provided the structures are embedded from an OLE compatible drawing package
11:23 <ChemSpiderMan> Yes...ACS blesses ISIS and ChemDraw
11:23 <ChemSpiderMan> But...if we want to know we should ask them :-)
11:23 <walkerma> I sent a couple of emails to ACS people I know, but got no reply as yet
11:23 <+Physchim62> we already have, we're waiting for /their/ reply ;)
11:24 <walkerma> The two people I contacted tend to be very busy people, so it doesn't mean they don't want to help
11:24 <walkerma> I know someone less well from CAS, who I'll contact too
11:25 <walkerma> Does anyone else here have useful contacts who might be able to answer our questions, and/or help?
11:25 <ChemSpiderMan> Just fyi structrue IMAGES can be converted too...but I don't know whether CAS do that: http://www.chemspider.com/blog/clide-and-more-complexity-chemspider-has-agents-and-eyeballs.html
11:25 <+Physchim62> but which numbers do we want to use?
11:25 <ChemSpiderMan> yes...back on topic...
11:26 <+Physchim62> CAS numbers are *not* unique
11:26 <+Physchim62> (nor could they be)
11:26 <walkerma> Before we leave validation:
11:26 <walkerma> Can we agree that we
11:26 <+Physchim62> some numbers are taken "out of service" on a regular basis
11:27 <walkerma> (a) Try to comeup with a quicker way to get CAS #s from an ACS source then
11:27 <ChemSpiderMan> The biggest part of the problem ca be solved by getting the CS number for the "structure/compound" as drawn. MOST of the "organic structures" I am curating likely have one CSA number
11:27 <walkerma> (b) If that fails, try and plod through with Scifinder where possible
11:27 <+Physchim62> (a) any "reliable source"
11:28 <+Physchim62> (b) is the point I was trying to get on to
11:28 <walkerma> (c) If all else fails use "any reliable source" such as ChemSpider?
11:28 <+Physchim62> if possible, multiple reliable sources
11:28 <ChemSpiderMan> No...please don't! While I am flattered we are definitely NOT an authority for CAS
11:28 <ChemSpiderMan> on phone now
11:29 <walkerma> OK, (c) could perhaps be multiple sources
11:30 <walkerma> Can we agree on that as a strategy?
11:30 <ChemSpiderMan> okay..back...call rescheduled.
11:30 <ChemSpiderMan> The problem you will have with multiple sources is MOST are contaminated
11:30 <ChemSpiderMan> I've tried this approach
11:30 <+itub> I think the only sure way is to check each with scifinder
11:31 <dmacks> Annotate the CAS# with "from whatever" if can't find authoritative source?
11:31 <ChemSpiderMan> You can find numbers but going back to your comment Martin...if you want them validated you need to go to the authority
11:31 <walkerma> dmacks: Yes that sounds like a good compromise
11:31 <ChemSpiderMan> yes...agree with DMACKS
11:31 <+itub> since cas numbers are opaque they are very often not checked for accuracy
11:31 <ChemSpiderMan> if you are willing to annotate that way that would work
11:32 <walkerma> ChemSpiderMan: What would you recommend as some of the more reliable non-ACS sources?
11:32 <dmacks> Alternative, have separate cas= and cas_unvalidated= fields in the infobox, then can know which aren't authoritative (and even search for them if one has some time and SciFinder access)
11:32 <walkerma> dmacks:Yes!
11:33 <+itub> good idea
11:33 <ChemSpiderMan> works for me
11:33 <+itub> better call them cas and cas_validated
11:33 <walkerma> yes
11:33 <+itub> so the default for existing cases is not validated
11:33 <dmacks> yeah
11:34 <ChemSpiderMan> for sure
11:34 <+itub> has there been any discussion of the legal issues? CAS says that one can only use up to 10,000 numbers for free...
11:35 <ChemSpiderMan> http://www.chemspider.com/blog/how-many-electronic-databases-have-more-than-10000-cas-numbers.html
11:35 <ChemSpiderMan> They really can't hold to this at this point
11:36 <ChemSpiderMan> They have to "enforce" such rules historically in order to apply them now
11:36 <+Physchim62> itub, this has been discussed at length in the past, it doesn't hold up
11:36 <ChemSpiderMan> PubChem has way more than 10000
11:36 <+itub> ok
11:36 <+Physchim62> Feist v. Rural is the U.S. copyright case
11:36 <+Physchim62> among others
11:37 <+itub> IMO it is a case of "you can copyright phone numbers"
11:37 <+itub> can't
11:37 <ChemSpiderMan> wow...great reference! Thanks...
11:37 <dmacks> Yeah!
11:38 <+Physchim62> you can't copyright *a collection* of phone numbers (unless your collection is particularly original, which CAS's isn't)
11:38 <walkerma> If we want ACS's help, we may want to ask them nicely for permission..!
11:39 <+Physchim62> WP's originality is in our selection of compound's, which is our own and which is licensed under th GFDL
11:39 <walkerma> We are a non-profit, after all
11:40 <ChemSpiderMan> Good luck...again, read this one later but I doubt you'll get support. i HOPE I am wrong.... http://www.chemspider.com/blog/intention-to-scrape-crystaleye-content-and-staying-in-relationship-with-publishers.html
11:40 <ChemSpiderMan> I cannot get an answer from the copyright group so a simple question
11:41 <+Physchim62> in short, there are no probs for the WMF: there might be problems for individual contributors, especially in Europe, but even here the risk is very limited
11:41 <walkerma> OK: Are there other CAS# issues we need to resolve today?
11:41 <ChemSpiderMan> so, CAS validated and CAS in the box then?
11:41 <walkerma> I think we can all agree on that
11:41 <ChemSpiderMan> Do we agree that we are out to get a CAS number consistent with the structure/compound shown?
11:42 * dmacks nods
11:42 <walkerma> Yes, I was hoping we could agree on that too
11:42 <ChemSpiderMan> so, a particular tautomer has the CAS for that tautomer
11:42 <+Physchim62> can I make I plea for us to use CAS numbers which relate to the title of the article, rather than necessarily the structure shown in the chembox?
11:42 <walkerma> That way it matches with the InChI, the SMILES, the IUPAC name
11:42 <walkerma> AND THE TITLE OF THE ARTICLE!
11:42 <walkerma> All should match!
11:42 <+Physchim62> on WP, the *title* is the primary key
11:42 <ChemSpiderMan> I don't think that's going to work
11:43 <ChemSpiderMan> I agree the title is the primary key
11:43 <+Physchim62> matching everything to the title is an impossibility in some cases
11:43 <ChemSpiderMan> agree
11:43 <ChemSpiderMan> glucose
11:43 <walkerma> Yes, there will be a problem in some cases
11:43 <ChemSpiderMan> The InChI is a form of the structure
11:43 <ChemSpiderMan> The SMILES is a form of the structure
11:44 <ChemSpiderMan> They convert to the structure'they do not convert to the title
11:44 <walkerma> So we need some clear guidelines in cases like Glucose, Tartaric acid
11:44 <ChemSpiderMan> The IUPAC name is for the structure
11:44 <+Physchim62> if there is a /generic/ CAS no for CoA, we should use that /regardless/ of the exact structure we show in the chembox. in an ideal world, we would also explain the problem in the article text...
11:44 <dmacks> I think a given cheminfobox should be completely self-consistent (incuding
International Chemical Identifier | |
---|---|
InChI= | |
InChIKey= | |
CASRN= | |
PIN= |
and other chembox-like stuff I guess) as a factual matter, than it's a WP editorial matter which infobox(es) go on each page.
11:44 <ChemSpiderMan> what about "CAS for structure" and "Generic CAS"
11:44 <ChemSpiderMan> yes dmacks. yes
11:45 <ChemSpiderMan> the chembox should be self-consistent
11:45 <walkerma> How about: We aim to match all of them wherever possible
11:45 <+Physchim62> some chemboxes cannot be self-consistent, because they include data for multiple compounds
11:45 <walkerma> But when they don't, we do what CHemSpiderMan suggested
11:46 <walkerma> One CAS to match the article name, another to match the structure drawing/InChI/SMILES/IUPACName
11:46 <walkerma> Made clear which is which
11:46 <+Physchim62> don't forget that the CAS number for the structure could (in principle) be included as metadata in the image
11:47 <walkerma> But the way we display it should be standardised
11:47 <+Physchim62> hard to do in practice, I'm well aware!
11:47 <walkerma> We read the metadata and display it as CAS for structure
11:48 <+Physchim62> why do we need to display it?
11:48 <dmacks> Yeah, /me was just about to say "more metadata that isn't derived from structure raises barrier to editors"
11:48 <dmacks> (metadata _in image file_)
11:48 <walkerma> Beetstra: Are you there?
11:48 <+Physchim62> we should be displaying the data which isn't immediately obvious, which in this case seems to be the /generic/ CAS number
11:49 <walkerma> But we don't want to show (say) a particular form of D-glucose in the diagram, then only display the CAS# for generic glucose
11:49 <+Physchim62> WP exists to impart useful information, not as an exercise in self-consistency: at least, it is more successful in impartint useful information than as an exercise in self-consistency ;)
11:50 <+Physchim62> why not? the article is about glucose, after all, not just about alpha-D-glucose
11:50 <ChemSpiderMan> and this is where we are back to a "decision" :-)
11:51 <walkerma> Because then the reader will think that alpha-D-glucose has the CAS # shown just below
11:51 <ChemSpiderMan> for glucose the structure shown should have no stereo...
11:51 <ChemSpiderMan> the particular forms of glucose can be shown as alpha and beta and they have their own CAS
11:51 <ChemSpiderMan> but that would be in the article
11:52 <ChemSpiderMan> not the Cbox
11:52 <walkerma> Or else in some cases we show more than one form, as at Carvone
11:52 <walkerma> http://en.wikipedia.org/wiki/Carvone
11:52 <+Physchim62> ChemSp^iderMan, showing glucose with no stereo only makes the problem six times bigger!
11:53 <dmacks> 2^6?
11:53 <ChemSpiderMan> Did you see the discussions that happened about glucose already?
11:53 <+itub> I think several cas numers in the infobox are ok
11:53 <+itub> if there are too many, add a table somewhere else
11:53 <ChemSpiderMan> MArtin...do you have that link?
11:53 <walkerma> http://en.wikipedia.org/wiki/User:Walkerma/Sandbox5#Comments_1
11:53 <ChemSpiderMan> thx
11:54 <walkerma> Actually, I was wondering how to fix that
11:54 <+Physchim62> dmacks, there are eight aldohexoses, each with two enantiomers and two ring forms
11:54 <dmacks> What if we have a chemsubbox, that contains just the isomer-specific or component-specific info for each thing in a less-than-specific chembox?
11:54 <walkerma> Glucose gets about a million hits a year, and we're giving them the name for galactose right now!
11:55 <ChemSpiderMan> gents...we can go into minute detail about the outliers now but I think we have about 95% of the articles without this problem
11:55 <walkerma> dmacks: As ever, you offer a very workable compromise
11:55 <+Physchim62> that was doubtlessly an OrgChem prof who wanted to check if his students were copying from WP!
11:55 <dmacks> *blush*
11:55 <walkerma> Actually, it was somoeone from [affiliation removed for privacy]
11:55 <+itub> but most aldohexoses are not glucose
11:56 <ChemSpiderMan> [name removed for privacy]
11:56 <walkerma> (Please rm that name from the log, though - I wanted to keep IDs of respondents confidential)
11:56 <+Physchim62> so, how are we going to work the validation process, assuming that we are going to have one for many, if not all, of our articles?
11:56 <dmacks> okay
11:57 <ChemSpiderMan> oops..sorry
11:57 -!- Beetstra [n=djbeetst@Wikimedia/Beetstra] has quit ["Bye Bye"]
11:58 * Physchim62 is going to have to disappear in a few minutes or so
11:58 <walkerma> I think the process should be (a) Mostly wait and see if ACS can help
11:59 <walkerma> (b) If they definitely can't we need to work through the lists
11:59 <ChemSpiderMan> I'll put the curation process on hold until they do ;-)
11:59 <+Physchim62> I'm fine with that if other people are
11:59 <walkerma> http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Chemistry/CAS_validation
11:59 <walkerma> We also need to see if we can get CAS_validated into ChemBox new
12:00 <ChemSpiderMan> You need a timeline...if ACS don't answer by XXX then we do YYYY
12:00 <walkerma> How about the end of the month?
12:00 <ChemSpiderMan> My question to them in November resulted in them asking me to meet them in New Orleans in April to discuss
12:00 <+Physchim62> I don't think that CAS validation is the most important thing for the project at the moment, although it is linked with other decisions that we need to take
12:00 <ChemSpiderMan> end of month is GOOD
12:01 <walkerma> Physchim62: Currently it is one of the weakest areas of our chemical substance pages
12:01 <walkerma> We have a lot of errors
12:01 <+Physchim62> bovine excrement!
12:01 <+Physchim62> IMHO
12:02 <walkerma> You mean that the severe lack of non-childish content is our most pressing problem?
12:02 <+Physchim62> it is an area where certain people have chosen to attack us, that is all!
12:02 <ChemSpiderMan> I can deal with the triples of strutcure representations, SMILES, InChIs and names in general. But I cannot deal with CAS in the validation
12:02 <ChemSpiderMan> Have people attacked about CAS?
12:03 <walkerma> Only occasionally
12:03 <+Physchim62> on many occasions!
12:03 <ChemSpiderMan> hmmm....
12:03 <+Physchim62> it's easy to pick up on, beacuse CAS numbers are so widely used
12:04 <+Physchim62> but if all you do is reply to attacks, you're never going to *construct* anything
12:04 <dmacks> Skool-kidz vandalism isn't any "badder" than typos and data taken from an incorrect MSDS though. It's the same validation problem.
12:04 <walkerma> Anyway, Physchim62: Can you, Andrew and Beetstra agree how to handle the generic CAS vs specific CAS in the ChemBox?
12:04 <walkerma> That way, if/when we hear back from CAS/ACS we are ready
12:05 <walkerma> Perhaps I should've said: generic CAS plus specific CAS
12:05 <+Physchim62> OK, as I see it, "specific CAS" must go *directly* below the structure: but that's eminently doable
12:05 <walkerma> Yes
12:06 <walkerma> Or we could use dmacks suggestion
12:06 <+Physchim62> I'll speak to Dirk and Andrew when I get a chance: they will no doubt read the logs or the summary anyway
12:07 <walkerma> One point on that: If I get something wrong/miss something in the summaries, please edit
12:07 <+Physchim62> dmacks suggestion is technically more difficult: it could be implemented for individual "problem cases" but I'm hesitant about making it a general standard
12:08 <walkerma> Fair point
12:08 <dmacks> Yeah, was only intended for cases where it was needed, not as a general-use thing.
12:09 <walkerma> Well, I'd like to draw things to a close. Is there anything else "burning" to discuss today?
12:09 <+itub> I was looking at http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Chemistry/CAS_validation and the links at the bottom ask me for a password
12:09 <ChemSpiderMan> Sorry...that's my fault!
12:10 <ChemSpiderMan> I check from within the domain...
12:10 <ChemSpiderMan> you shouldn't need one
12:10 <ChemSpiderMan> I can send the new path to Martin when resolved or just annotate the page....let me do that
12:10 <ChemSpiderMan> it's a redirect issue
12:11 <ChemSpiderMan> will get on it now
12:11 <walkerma> It's a ChemSpider password. I've had problems - ChemSpider wouldn't let me log on via that system for some reason
12:11 <walkerma> I was able to log on in other ways
12:11 <ChemSpiderMan> bottom line there are now 150 structures waiting for "validation:
12:11 <walkerma> That's why I haven't updated the main validation list
12:11 <ChemSpiderMan> ok...will fix
12:11 <walkerma> (the other list for structures/InChIs etc)
12:11 <ChemSpiderMan> we just shuffled domains and it's the problem
12:13 <walkerma> For next week: Physchim62 has proposed a very elegant way of putting all of our data into a nice PersonData-type database
12:13 <walkerma> Could we put that on the agenda?
12:13 <walkerma> So we can all tear his idea to shredsxxxxxxxxxxx come up with some helpful ideas?
12:14 <+Physchim62> :P
12:14 <ChemSpiderMan> not here next week Walkerma...will be travelling.
12:14 <walkerma> I happen to think it's a brilliant proposal that will help us move into the next level of trustworthiness & reliability
12:14 <+Physchim62> I will try to see what the IUBMB recommend for biochemical nomenclature
12:15 <walkerma> Other remaining issues include: Resolving how we present carbohydrates, ATP-type phosphates, amino acids, etc
12:15 <+Physchim62> "protonation issues"
12:15 <walkerma> Yes
12:16 <dmacks> yup
12:16 <walkerma> I'd like CHemSpiderMan to be around for that discussion - so should we hold off on the protonation issue till he returns?
12:16 <+Physchim62> we obviously need to discuss these, even if we cannot come to a quick and simple conclusion
12:16 <+Physchim62> OK
12:17 <walkerma> So should we discuss the PersonData proposal next week?
12:17 <+Physchim62> I propose "choice and indexing of identifiers"
12:17 <+Physchim62> as the agenda item
12:17 <walkerma> Good one - can you elaborate though
12:18 <+Physchim62> 1. which identifiers are the most important for us (already discussed, I realise)
12:19 <+Physchim62> 2. should we create indexes on these identifiers
12:19 <+Physchim62> 3. under what circumastances should we link out to external sites
12:20 <ChemSpiderMan> Wish I could be there for number 3.
12:20 <+Physchim62> ChemSpiderMan, don't worry, it's a perennial topic!
12:21 <ChemSpiderMan> no surprise!
12:21 <walkerma> Are there any other things we should consider covering next week?
12:21 <walkerma> If not, I'd like to see PC's suggestion as the main topic
12:22 <walkerma> It's perennial, but hopefully we can move things forward a bit
12:24 <+Physchim62> must dash, TTFN
12:24 <walkerma> OK, let's call it a day/night! PC: Can you hang on, I wanted to ask you a quick question about the GoldBook work
12:24 -!- Physchim62 is now known as PC62|away
12:24 <+PC62|away> OK, you just caught me
12:24 <ChemSpiderMan> I am talking to Fabienne Meyers about this...
12:25 <walkerma> I wanted to update the worklist, for things I've worked on, but it's very unclear. Can you put some instructions on wiki about how to do this
12:25 <walkerma> So it's clear where (as with anil) there is NOW a redirect from anil to Schiff base, there wasn't before - I think the work is done, now that the Schiff base article cites GoldBook
12:27 <+PC62|away> we should have a cite to the entry for "anil" as well, surely
12:27 <+PC62|away> anil is a IUPAC accepted term for a certain type of Schiff base
12:27 <walkerma> It's not really notable enough IMHO. Same with Crown (conformation) which is one conformation (not the major one) for eight membered rings
12:27 <+PC62|away> so long as the ref is in there, simply change the template to say Status=Y
12:28 <walkerma> Do I change anything else?
12:28 <+PC62|away> OK, I'll have a think about it. some of these terms really belong on Wiktionary rather than WP
12:28 <+PC62|away> no
12:28 <walkerma> OK, that's it! Thanks
12:29 <ChemSpiderMan> bye
12:29 <+PC62|away> when you think the links are OK, you change the status to Y
12:29 <walkerma> I'll "see" everyone next week? Except Antony..
12:29 <+PC62|away> that way, other's know not to worry about it
12:29 <walkerma> Thanks
12:29 -!- ChemSpiderMan [n=ChemSpid@c-68-33-151-242.hsd1.md.comcast.net] has quit []
12:29 <dmacks> I think I have a seminar, but will try to pop by depending on how long it goes.
12:29 <+PC62|away> connection willing, yes!
12:30 <walkerma> Bye!
12:30 -!- walkerma [n=chatzill@admin-151-108.potsdam.edu] has quit ["ChatZilla 0.9.80 [Firefox 2.0.0.11/2007112718]"]
12:30 <+itub> bye
12:30 -!- itub [n=tubert@lalo.chemie.unibas.ch] has left #wikichem []
--- Log closed Tue Feb 05 12:30:43 EST 2008