Evergreen ILS Website

IRC log for #evergreen, 2016-12-13

| Channels | #evergreen index | Today | | Search | Google Search | Plain-Text | summary | Join Webchat

All times shown according to the server's local time.

Time Nick Message
00:54 StomproJ joined #evergreen
03:04 book` joined #evergreen
03:17 Stompro joined #evergreen
05:01 pinesol_green News from qatests: Test Success <http://testing.evergreen-ils.org/~live>
06:17 csharp dbs++ # closed date fixes
07:00 agoben joined #evergreen
07:13 rjackson_isl joined #evergreen
07:22 JBoyer joined #evergreen
08:08 rlefaive joined #evergreen
08:38 mmorgan joined #evergreen
08:40 collum joined #evergreen
08:54 bos20k joined #evergreen
08:54 Dyrcona joined #evergreen
09:04 krvmga joined #evergreen
09:08 kmlussier joined #evergreen
09:17 yboston joined #evergreen
09:24 mdriscoll joined #evergreen
09:28 dbs csharp++ # signing off! I'll push those in a bit
09:33 pinesol_green [evergreen|Dan Scott] LP#1432753 Restore "All day" verbiage to Closed Dates editor - <http://git.evergreen-ils.org/?p=​Evergreen.git;a=commit;h=8bfa94f>
09:33 pinesol_green [evergreen|Dan Scott] LP#1594937 Fix off-by-one display of closed dates - <http://git.evergreen-ils.org/?p=​Evergreen.git;a=commit;h=233e61f>
09:35 csharp hooray!
09:35 mmorgan dbs++
09:36 mmorgan csharp++
09:42 kmlussier dbs++ csharp++
09:57 Christineb joined #evergreen
10:06 Bmagic dbs++
10:06 Bmagic csharp++
10:41 Jason_ joined #evergreen
10:51 Bmagic I keep having this memcached issue and I can't figure it out. memcached doesn't log anything. Our libraries have to continue to login every step of the way through the staff client. It's like memcached is evicting the key, but memcstat shows 0
10:52 Bmagic I found these in osrfsys.log open-ils.circ [ERR :21628:Cache.pm:126:1481641510398122] Unable to store oils_permit_key_740b499e8fdcd95de95ce0a8d48c0cc0 => [1 bytes]  in memcached server
10:52 Bmagic then I restart memcached, and everything is fine for a few days
11:01 Dyrcona Bmagic: Never seen that before but sounds like a memcached bug to me.
11:01 Stompro Bmagic, how many connections does memcached have open when this happens?  Any chance new connections are being denied?  With our openvz VM's, we sometimes hit the artificial resource limits on tcp buffers and connections, running openvz?
11:01 pastebot "Bmagic" at 64.57.241.14 pasted "Failed state of memcached" (105 lines) at http://paste.evergreen-ils.org/36
11:04 Bmagic Stompro: not running openvz
11:05 Bmagic if it was TCP, then restarting memcached wouldn't help right?
11:05 Stompro Bmagic, have you captured any traffic between the memcache server and your app servers that shows the issue?
11:06 Bmagic no, that would require us to be in a failed state while everyone waited for me to set that up. I want to get it up and running asap. but maybe I could have it capturing all the time....
11:06 berick you can hit resource limits regardless of openvz, of course
11:07 berick memcache has a connection limit setting (defaults to 1024)
11:07 Stompro Bmagic, if the memcached process was holding onto connections, not fully closing them, then it might report X, but linux might show more.
11:07 berick the OS will have its own open connection limits
11:07 Bmagic I thought it was resource limits at first, so I upped the ram from 4gb to 8gb (as you can see from the pasted config) - also changed servers from no swap to swap
11:09 Bmagic not sure about the tcp connections though. the OS will block new tcp connections after the limit is reached? Where is that config?
11:09 Bmagic oh sysctl
11:11 Stompro Bmagic, there was a bug with added content a while back where the connection wasn't being fully closed, so apache was eating up connections... let me find it, the troubleshooting steps might help look for a problem.
11:11 Dyrcona net.core.somaxconn
11:11 Dyrcona That's a per socket max connections.
11:12 Dyrcona I believe it's per socket. Looks like the default is 128. You had 123 in your output.
11:12 Bmagic hmmm... that could be something
11:12 Dyrcona probably another way to set it other than sysctl.
11:13 Dyrcona There are probably additional admin type connections to the socket that aren't counted by memcached.
11:13 Bmagic hmmm
11:14 Bmagic how many connections do you have in production for example?
11:14 Dyrcona We run two memcached servers. I'll check in a minute.
11:14 Stompro Bmagic, Found it, Bug 1478123.
11:14 pinesol_green Launchpad bug 1478123 in Evergreen 2.8 "Missing close() after shutdown() in EGCatloader/Record.pm FD Leak" [Medium,Fix released] https://launchpad.net/bugs/1478123
11:16 Dyrcona Stompro: That shouldn't affect memcached, unless memcached is on the save server and you run out of descriptors to open new sockets with. :)
11:16 Bmagic Stompro: my memcached server is on a separate machine from all of the apache app servers
11:16 Dyrcona s/save/same/
11:16 Bmagic it is interesting though
11:17 Stompro Bmagic, I just meant that looking for incorrectly closed connections might be a troubleshooting step.  Not that this specific bug was a problem.
11:17 Bmagic roger that
11:18 Dyrcona Bmagic: What's the command you ran to get that output? It has been a while since I've looked at memcached statistics.
11:18 * Stompro was just googling the same thing.
11:18 Bmagic memcstat --servers (ip address)
11:19 Bmagic I wish I knew what my connection numbers were on our old setup. I do recall all of the app servers running their own memcached instances (not realizing) which turned out to be in use from eg_vhost.conf. Now, I have edited that config to point to the main server
11:20 Bmagic perhaps with the splintered memcached servers before, I never hit the 128 limit, but now that I have all of the connections talking to the same server.....
11:21 Bmagic in any event, 128 is low, or there is a bug with getting those closed
11:22 Bmagic well, I am looking again, and now I have 133 curr_connections - everything is working fine right now, so, 128 must not be the limit
11:23 Dyrcona Bmagic: We have 529 and 728 current connections on the two servers.
11:23 Bmagic and what is your net.core.somaxconn setting?
11:23 Dyrcona Hm.. It's 128 on the first one.
11:24 Bmagic darn, I was hoping it was that
11:26 Dyrcona That's the core value. There's a way to override it.
11:29 Dyrcona Great... Running into this on jessie: /usr/bin/ld: cannot find -ldbdpgsql
11:31 Dyrcona Crazy, but I'm not finding it. Mabye I'm thinking of BSD systems.
11:33 Dyrcona Bmagic:  not sure if raising net.core.somaxconn will help you or not.
11:34 Dyrcona And, funny thing about the link error, I don't really need cstore compiled on this machine, 'cause I don't plan to run services. I'm just installing for some of the libs.
11:39 Dyrcona Weird. Is this just a problem with 32-bit systems? 'Cause I didn't have this problem on the 64-bit vm.
11:40 brahmina joined #evergreen
11:40 bshum It wouldn't surprise me
11:40 bshum We haven't tested 32-bit installs with Evergreen in a long time...
11:40 bshum At least I haven't
11:40 bshum And there were problems the last times we had anyone try it
11:42 Dyrcona It's right there: /usr/lib/i386-linux-gnu/dbd/libdbdpgsql.so
11:42 Dyrcona Thing is, I don't need this. I just wanted some of the Perl libs that I use in some scripts and they don't depend on that.
11:42 Dyrcona However, we have no flags to disable building the services.
11:44 Bmagic I think I need more information to figure out my issue. I have -vv switch in memcached config but I still get zero log entries in /var/log/memcached.log
11:45 Bmagic maybe that file wasn't writable? I have created it manually, and gave memcace user permissions. But memcached was already started, maybe restart tonight and we'll see
11:46 bshum Dyrcona: https://bugs.launchpad.net/evergreen/+bug/1445182 maybe?
11:46 pinesol_green Launchpad bug 1445182 in Evergreen 2.7 "Debian Jessie 2.8.0 libdbi-drivers cannot find libpq.so" [Undecided,Fix released]
11:46 bshum Ghost of, I mean
11:47 Dyrcona Yeah, no. You're not supposed to install it by hand on Jessie, 'cause there's a package.
11:48 Dyrcona It should just work.
11:48 Dyrcona And, it did on 64-bit.
11:49 bshum Ah
11:49 bshum Right then :)
11:49 Dyrcona So, I'll mangle the makefiles and add the full path to the lib where it should go. That should get me past this.
11:52 Dyrcona Bingo! We have a winner!
11:55 Bmagic What do we have for him Jonny? It's a NEW CAR!
11:56 Dyrcona heh! Now, I hear the music in my head.
12:01 Jason_ Question: I'm installing evergreen on a test server (ubuntu-trusty) and I've encountered an issue with opensrf configuration > Websockets installation. The git clone does not contain the example configuration file "examples/apache_24/websockets/apache2.conf" listed in the instructions. Any ideas? Or should I skip it for now as websockets are optional?
12:01 bshum Jason_: The example config file is in the opensrf source files
12:01 bshum Not in the git clone for websockets
12:02 bshum That instruction is getting updated in a future release to be more clear I think.
12:03 bshum Websockets are "optional" if you're not planning to use the web-based staff client (which is still under development).  So yes, you could technically skip the step.
12:03 Jason_ Got it, and found the file! Thanks :)
12:04 kmlussier Jason_: Dyrcona's blog post on the web client installation is helpful. It identifies a couple of things like that issue that aren't clear in the instructions. http://blog.mvlcstaff.org/2015/06/sett​ing-up-evergreen-browser-staff_2.html
12:05 Jason_ I'll check that out, thanks
12:45 _adb joined #evergreen
12:54 bmills joined #evergreen
13:03 sandbergja I'm running into an issue with authority_control_fields.pl: every time I run it with the -a or --all flag, it gives me the error message "Can't use an undefined value as an ARRAY reference at /openils/bin/authority_control_fields.pl line 77."
13:04 sandbergja Running it with the --start and --end works just fine, but I want to set it up to run regularly, and don't want to have to calculate the --end value each time
13:10 Dyrcona sandbergja: That would seem to indicate that it is either timing out or failing to retrieve all of the non-deleted bibs in your database.
13:12 Dyrcona Or the list of ids returned is too big for ejabberd to handle.
13:12 sandbergja That's helpful, thanks
13:14 sandbergja If it is a timing out issue, would that be in opensrf or postgres or somewhere else?
13:15 Dyrcona OpenSRF. Postgres doesn't time out.
13:15 sandbergja Got it.
13:17 Dyrcona If you specify the days_back option, it connects to the database to get the list of ids.
13:17 Dyrcona It should probably do that for the all option as well.
13:20 sandbergja Oooh, let me try that out!
13:21 bmills joined #evergreen
13:21 Dyrcona By "it should," I mean it doesn't now, but should be changed to do so. Are you going to take a crack at changing the code?
13:23 sandbergja I think that'd be a little advanced for me, but I think that the --days_back flag is actually what we need, rather than --all
13:24 Dyrcona OK. I ususally use days_back, like 90 or so for authority updates.
13:24 sandbergja I might throw together some documentation, because it doesn't look like the --days_back flag is mentioned in the official docs or the help screen on authority_control_fields.pl -h
13:26 sandbergja Thanks so much for your help!
13:26 sandbergja Dyrcona++
13:26 mmorgan joined #evergreen
13:29 Dyrcona Looks like the days_back option was added in 2012 and the documentation for the help was added in 2011.
13:29 Dyrcona Two potential Lp bugs, there. :)
13:30 sandbergja Fun fun!
13:31 sandbergja I can report those and work on the documentation aspects.
13:33 Dyrcona Cool. I'll see about fixing the all option.
13:35 sandbergja Thanks!  It seems like that would be very helpful for us to run with the --all option eventually (e.g. 2 years from now, when we get an authority record that needs to be linked to a bib record we haven't touched in forever)
15:04 mmorgan1 joined #evergreen
15:08 Dyrcona joined #evergreen
15:17 JBoyer Bmagic, I ran into the same problem you're seeing when we increased the number of app servers and the number of spare children at the same time. memcached has a connection limit specified in it's config file and the default is 1024. Here I'm using 8192 or thereabouts. It's the "# -c 1024" line in the memcached.conf file you pasted, uncomment it and bump it up and you should stop seeing that happen.
15:18 Bmagic JBoyer: I can't believe we are at 1024, the output of the running config of memcached only showed curr_connections: 123
15:19 JBoyer Oops, missed that part of the paste. Carry on!
15:19 Bmagic perhaps the number of connections with regaurds to that setting is not output from memcstat?
15:20 JBoyer I would trust memcstat over netstat, since memcached will likely not "count" connections in CLOSE_WAIT or other non-connected statues.
15:29 Dyrcona Bmagic: I don't think the defaults are output, and the config file could actually be different from the options set when memcached was built.
15:30 Dyrcona I'd try just uncommenting that line to see and restarting memcached to see what happens.
15:30 Bmagic I sure will
15:30 Dyrcona oops. bad on the fly editing.
15:30 Bmagic 1024 seems fine
15:30 Dyrcona I'll check out configuration.
15:30 Bmagic of course, I have to wait till 4am
15:32 Dyrcona Ours is still commented, but we're on a different distro from what I think you're running.
15:37 Dyrcona Bmagic: You're running this in a docker appliance, yeah?
15:37 Bmagic memcached is running on a plain ol vm
15:38 Bmagic the docker stuff is app bricks only
15:38 Dyrcona OK. Thanks. Thought maybe some docker setting could apply here, but I don't know docker, so...
15:38 Bmagic I don't think that is in the equation
15:39 Dyrcona Have you had another look with memcstat and has anything changed much?
15:40 Bmagic curr_connections: 129         total_connections: 5180         connection_structures: 155
15:40 Bmagic not really
15:41 bshum That's a bit higher total
15:42 bshum But probably not insane
15:42 * bshum keeps musing to himself
15:43 Bmagic yeah, I agree, and because the system is running fine right now....
16:39 mmorgan joined #evergreen
17:01 pinesol_green News from qatests: Test Success <http://testing.evergreen-ils.org/~live>
17:07 mmorgan left #evergreen
18:35 bmills joined #evergreen
18:56 bmills joined #evergreen
19:44 StomproJ joined #evergreen
21:36 bmills joined #evergreen
22:42 littleDude joined #evergreen

| Channels | #evergreen index | Today | | Search | Google Search | Plain-Text | summary | Join Webchat