#jruby on 2021-03-11 — irc logs at freenode.irclog.whitequark.org

2021-02-24 19:20 ChanServ changed the topic of #jruby to: Get 9.2.15.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

00:10 ur5us has joined #jruby

00:42 <headius[m]> finally have a HEAD build working with rubocop-ast

00:52 travis-ci has joined #jruby

00:52 <travis-ci> jruby/jruby (fix_dist_gems:d66107b by Charles Oliver Nutter): The build has errored. https://travis-ci.com/jruby/jruby/builds/219691581 [22 min 5 sec]

00:52 travis-ci has left #jruby [#jruby]

01:07 <travis-ci> jruby/jruby (fix_dist_gems:074e096 by Charles Oliver Nutter): The build is still failing. https://travis-ci.com/jruby/jruby/builds/219691727 [166 min 44 sec]

01:07 travis-ci has joined #jruby

01:07 travis-ci has left #jruby [#jruby]

01:37 fzakaria1 has joined #jruby

01:37 <fzakaria1> headius: did something change about JDK modularization in JDK 9.2.16.0

01:37 <fzakaria1> Tracking down a new `Java::JavaLang::NoClassDefFoundError: java/sql/Date` which seems like the culrprit

01:41 <headius[m]> fzakaria: hmm I don't think so

01:42 <headius[m]> JRuby itself is still just using automatic module name and I don't think anything changed in how we start up

01:45 <headius[m]> fzakaria: I scanned a diff and didn't see anything obvious

01:46 <headius[m]> can you confirm that 9.2.14.0 did not do the same thing?

01:47 <fzakaria1> i have a small repro. let me share here.

01:47 <fzakaria1> very interesting.

01:47 <fzakaria1> It's with RubyTime

01:48 * fzakaria1 < https://matrix.org/_matrix/media/r0/download/matrix.org/NbiqwDFZkOoyXPdrhEBrFTfy/message.txt >

01:49 <fzakaria1> Want me to file a bug? A bit of a headscratcher for me right now. Clearly `java/sql/Date` exists, since in an earlier statement you can see I make one :)

01:54 <fzakaria1> Looks irrespective of version

01:58 <fzakaria1> https://github.com/jruby/jruby/issues/6608

03:44 Antiarc_ has quit [Ping timeout: 240 seconds]

04:13 Antiarc has joined #jruby

04:15 ur5us has quit [Ping timeout: 264 seconds]

04:17 Antiarc has quit [Ping timeout: 264 seconds]

04:23 Antiarc has joined #jruby

04:32 Antiarc_ has joined #jruby

04:34 Antiarc has quit [Ping timeout: 264 seconds]

06:10 <headius[m]> fzakaria I am betting it is a module activation thing. May be able to activate it from the source, I need to look at the API

06:11 <headius[m]> Ironically, once you opt into modularity you are forced to be explicit about the modules you access

06:12 <headius[m]> This may be something we add some Ruby support for, like `Java.add_module "java.sql"`

06:29 nirvdrum has quit [Ping timeout: 272 seconds]

07:58 drbobbeaty has quit [Read error: Connection reset by peer]

07:59 drbobbeaty has joined #jruby

13:05 vext01 has joined #jruby

13:05 quadz_ has joined #jruby

13:06 Iambchop_ has joined #jruby

13:07 Iambchop has quit [Ping timeout: 256 seconds]

13:07 sagax has quit [Ping timeout: 256 seconds]

13:07 ebarrett has quit [Ping timeout: 256 seconds]

13:07 quadz has quit [Ping timeout: 256 seconds]

13:07 Iambchop_ is now known as Iambchop

14:13 nirvdrum has joined #jruby

16:09 travis-ci has joined #jruby

16:09 travis-ci has left #jruby [#jruby]

16:09 <travis-ci> jruby/jruby (fix_dist_gems:2251ad7 by Charles Oliver Nutter): The build is still failing. https://travis-ci.com/jruby/jruby/builds/219769258 [81 min 32 sec]

16:23 <headius[m]> e

16:23 <headius[m]> enebo: I remove the permgen flag from release doc

16:24 <headius[m]> I assume you weren't doing that anymore

16:24 <enebo[m]> headius: I saw that and yeah I stopped using it

16:27 <headius[m]> I have this build rework almost done... the one failure made me check how CRuby does this (it does the installs during `make install`) and we'll match after I finish this

16:30 travis-ci has joined #jruby

16:30 <travis-ci> jruby/jruby (fix_dist_gems:95ea3c7 by Charles Oliver Nutter): The build was fixed. https://travis-ci.com/jruby/jruby/builds/219771075 [166 min 1 sec]

16:30 travis-ci has left #jruby [#jruby]

16:31 <headius[m]> tada

16:31 <headius[m]> 🕴

16:31 <headius[m]> is that supposed to be a magician?

16:31 <headius[m]> 🎩

16:33 <headius[m]> enebo: at some point we need to clean break and do a complete reformat and cleanup of the pom.rb files

16:33 <headius[m]> I have been reluctant to do anything drastic since we want to be able to merge

16:33 <enebo[m]> yeah

16:34 <enebo[m]> I have wished there was "hinting" in merging for particular files

16:34 <enebo[m]> like when you merge a pom.xml file it seems like git could have some extra info somehow

16:34 <enebo[m]> but I guess if you PR a branch to change a pom then you would not want it

16:34 <headius[m]> wiping out the poms on master would be nice too, since that will be one less thing to ignore when merging

16:37 <headius[m]> all commits should be in for that dist gem thing now... I am doing another release deploy to get a final diff from 9.2.16.0 to 9.2.17.0

16:38 <enebo[m]> cool

16:49 <headius[m]> enebo: you already check out a clean repo before releasing but that is a requirement now... in order to have all bin files included without expanding this whitelist it just includes everything

16:50 <headius[m]> your mvn deploy should run against a clean clone... installing rspec or whatever after that finishes (so you can run that rake task) is fine

16:50 <headius[m]> if that makes you nervous maybe you can help me figure out a way to keep it clean without easily-outdated whitelists

16:51 <enebo[m]> oh hmm after running deploy I do run rake post_process_artifacts

16:51 <enebo[m]> Which should not require rspec

16:52 <enebo[m]> but I have not run against a dirty workspace in like a decade

16:52 <enebo[m]> I think when we had an ant build it did not matter at all

16:58 travis-ci has joined #jruby

16:58 <travis-ci> jruby/jruby (fix_dist_gems:e79285d by Charles Oliver Nutter): The build was broken. https://travis-ci.com/jruby/jruby/builds/219773909 [163 min 38 sec]

16:58 travis-ci has left #jruby [#jruby]

17:23 <headius[m]> ugh

17:23 <headius[m]> enebo: rake requires rspec because we load those tasks unconditionally

17:30 <headius[m]> good grief

17:30 <headius[m]> https://bugs.openjdk.java.net/browse/JDK-8263432

17:31 <headius[m]> what is it about our project that we hit at least one OpenJDK bug a month

17:36 <headius[m]> ok one last verification of this gem stuff locally because I botched a path

17:37 <headius[m]> lesson today is don't make changes after you verify things are working

17:54 <headius[m]> there may still be a bundler issue

17:54 <headius[m]> I have the files in the right place now, but bundler is looking in the wrong place

17:55 travis-ci has joined #jruby

17:55 <travis-ci> jruby/jruby (fix_dist_gems:bc1f1b6 by Charles Oliver Nutter): The build is still failing. https://travis-ci.com/jruby/jruby/builds/219780661 [163 min 56 sec]

17:55 travis-ci has left #jruby [#jruby]

18:02 <enebo[m]> headius: the rspec thing should be fine I will just gem install it after release but before post_processing

18:02 <headius[m]> yeah that will do it

18:02 <enebo[m]> the post processing works on jruby-dist

18:03 <enebo[m]> now I should check to see if we whitelist in the windows installer :)

18:03 <headius[m]> I want to get that rspec req gone because it isn't needed for most tasks

18:03 <headius[m]> soft req anyway

18:03 <enebo[m]> I feel like rake encourages loading the world

18:03 <headius[m]> in other news... it would be great if we could finally rip the ant stuff out into the jruby-ant gem

18:04 <headius[m]> we still have a dependency on ant to compile JRuby proper and exactly one project uses that feature (us)

18:04 <enebo[m]> yeah I was going to say I think we use it for something internally still

18:04 <headius[m]> or perhaps spin it into a maven artifact and the gem pulls that down

18:04 <enebo[m]> but it feels strange we cannot lean on java-compiler-plugin or something

18:05 <headius[m]> we use it for the stupidest thing: launching subprocesses

18:05 <headius[m]> I am pretty sure the only thing we use in our build is ant.exec

18:05 <enebo[m]> I think externally people (or perhaps me) use it for calling javac

18:05 <enebo[m]> Although I noticed racc does not leverage it. It appears to raw call javac

18:05 <headius[m]> ahh perhaps that too

18:06 <headius[m]> but it is doable other ways

18:06 <headius[m]> like getting those sources properly set up for maven to build them

18:06 <enebo[m]> Actually it uses javaextensionstask

18:06 <enebo[m]> which is greatr

18:06 <headius[m]> oh we must have fixed that at some point

18:07 <headius[m]> well just a few more steps to eliminate it

18:07 <enebo[m]> now what does javaextensionstask do :)

18:09 <enebo[m]> ok that is fine too

18:09 <enebo[m]> it does an sh on javac

18:24 <headius[m]> yeah, fancy

18:34 <fzakaria1> oh boi ant

18:35 <fzakaria1> that's an artifact of the times :)

18:40 <headius[m]> enebo: https://github.com/jruby/jruby/pull/6609#issuecomment-796956127

18:40 <headius[m]> this is done but one change introduced by including all bin/* stuff is that the bin/ruby symlink to bin/jruby is now included

18:41 <headius[m]> it was explicitly excluded before... and this should be ok on Windows since "ruby" as a filename does not look like an executable there, but I wanted to get your input

18:41 <headius[m]> will request a review

18:52 <enebo[m]> headius: I just read through it

18:53 <enebo[m]> My only ask would be what is different from a ls/listing from 9.2.16.0 and also that the shebangs are agnostic to who is building

18:53 <enebo[m]> I assume the later is true so I am just saying it because it came to mind when I read that change

18:54 <headius[m]> the change to always do env shebangs should avoid the latter from now on

18:54 <headius[m]> I have also proposed to RG that they just make env shebang the default and so far they agree

18:54 <headius[m]> but that is future work

18:54 <headius[m]> I did provide a diff listing in my last comment

18:54 <headius[m]> and explained the differences

18:56 <enebo[m]> ah I missed that although I remember reading the top

18:56 <enebo[m]> so out of that then bin/ruby is the only ? in my head really

18:57 travis-ci has joined #jruby

18:57 travis-ci has left #jruby [#jruby]

18:57 <travis-ci> jruby/jruby (fix_dist_gems:293975c by Charles Oliver Nutter): The build was fixed. https://travis-ci.com/jruby/jruby/builds/219790846 [166 min 2 sec]

18:59 <enebo[m]> "This may need testing in the zip file, since the symlink will obviously not work on Windows."

18:59 <enebo[m]> I believe this won't work anyways since it is not an executable or bat file

18:59 <headius[m]> yeah that occurred to me later

19:00 <headius[m]> I do not know why it was excluded

19:00 <enebo[m]> ruby?

19:00 <headius[m]> yeah

19:00 <headius[m]> I do not know how much of a risk it is for it to be in the dist now

19:00 <enebo[m]> I don't either but I wonder if it was someone who wanted ruby to be whatever c ruby they were using

19:00 <headius[m]> yeah but we know that doesn't work anyway

19:01 <enebo[m]> I guess for 9.2 that one is iffy to me but for 9.3 I am gung ho for it

19:01 <headius[m]> so it may be an old requirement from back when we still believed you could have two Rubies in path

19:01 <headius[m]> I could explicitly exclude it for 9.2

19:01 <enebo[m]> I just think it would be removing a variable

19:02 <enebo[m]> I am pretty curious to see if it causes problems but I want to be as done with 9.2 as we can be

19:02 <enebo[m]> not that we won't still fix issues for a while

19:02 <headius[m]> yeah better safe than sorry

19:05 <enebo[m]> headius: do you know of a good writeup on how this works: RubyClass stringClass = runtime.defineClass("String", runtime.getObject(), RubyString::newAllocatedString);

19:05 <headius[m]> lol writup

19:06 <enebo[m]> A static method which matches the interface methods definition

19:06 <headius[m]> oh you mean the method reference?

19:06 <enebo[m]> An explanation is fine but is this that obscure

19:06 <headius[m]> yeah that is all it is

19:06 <headius[m]> this is not really obscure and came along with lambdas in 8

19:06 <enebo[m]> but how does it implement it

19:06 <headius[m]> Method References

19:07 <enebo[m]> It has to resolve to a type and then it makes MHs to hook it up?

19:07 <headius[m]> it is mostly shorthand for (r, c) -> RubyString.newAllocatedString(r, c)

19:07 <headius[m]> but it may route the resulting interface impl directly to the target method more efficiently than a little class, I am not sure

19:07 <enebo[m]> ok so it will generate some handles which internally will replace the need for a type

19:07 <headius[m]> yeah

19:07 <headius[m]> it will be an invokedynamic to build a tiny interface impl that just calls that method

19:07 <headius[m]> and from then on just use the generated code

19:08 <enebo[m]> I brought up a year ago that I should change the parser to use lambdas as an experiment

19:08 <enebo[m]> So I have been thinking about that

19:08 <lopex> it's also called eta reduction

19:09 <enebo[m]> I am a bit worried about initial warmup but I think past experience tells me I need to just try it

19:10 <enebo[m]> I think the neat idea of this is I can hoist some variables and not pass them (although that may not be the win I think it is)

19:11 <headius[m]> warmup is a valid concern

19:11 <headius[m]> well, startup

19:11 <headius[m]> but consider this: it had to load the .class before anyway

19:11 <enebo[m]> yeah startup but mostly how bad cold perf is

19:12 <headius[m]> so unknown how much more or less overhead it is replacing an inner class with one of these

19:12 <enebo[m]> yeah :)

19:12 sagax has joined #jruby

19:12 <enebo[m]> It is actually pretty simple to try and it would be removing hundreds of types

19:14 <headius[m]> 7: invokedynamic #11, 0 // InvokeDynamic #0:allocate:()Lorg/jruby/runtime/ObjectAllocator;

19:14 <headius[m]> 12: invokevirtual #15 // Method org/jruby/Ruby.defineClass:(Ljava/lang/String;Lorg/jruby/RubyClass;Lorg/jruby/runtime/ObjectAllocator;)Lorg/jruby/RubyClass;

19:14 <enebo[m]> 599 classes

19:14 <enebo[m]> which will be like 800 or more with 3.0 grammar

19:15 * headius[m] < https://matrix.org/_matrix/media/r0/download/matrix.org/LQWaNreOkwLVPAUNXUvkrcHa/message.txt >

19:16 <enebo[m]> I remember when I made this megamorphic change it was only to reduce main method in LALR to compile to native and was really surprised how quick the PIC was

19:16 <enebo[m]> So I never rule out what will make a positive difference to the parser

19:17 <enebo[m]> Probably the main flaw of the parser now is the AST is about 600,000,000 objects

19:18 <enebo[m]> I am exaggerating but some slab allocated semi protobuf-like allocation would be better

19:18 <headius[m]> yeah so optimization wise this should be no worse and probably better

19:18 <headius[m]> it still has to generate a type to implement the ObjectAllocator interface but it will be tiny and loaded anonymously

19:19 <headius[m]> so loading that is probably less overhead than loading our inner class, but generating it may make the difference back

19:19 <enebo[m]> hey since I brought it up I will just try it. I am a little too into pondering this kwargs rest fix and it should be reasonably easy to change this

19:19 <enebo[m]> but if it is faster it may make it up again

19:20 <headius[m]> FWIW I believe one thing the CDS stuff is supposed to do over time is eagerly generate and link in these statically-resolvable lambdas

19:20 <enebo[m]> like gem list is zillions of evals (or it is in my workspace)

19:20 <headius[m]> "supposed to do"

19:20 <headius[m]> I am not sure if or when

19:20 <enebo[m]> ah but if it is ok now then it potentially will be better later

19:20 <headius[m]> I do think overall this is a win because we ship fewer classes and methods to verify

19:21 <headius[m]> and we can't avoid verification on 9+ anymore, so...

19:21 <enebo[m]> I have not recently timed later vs older for something really cold like gem list

19:21 <enebo[m]> I also mostly just see default GC in the way until I remember to use parallel

19:21 <headius[m]> yeah this would be minute

19:22 <headius[m]> reduced total classes by around 100, so there will be 100 or fewer of these additional now

19:22 <enebo[m]> yeah so if this works we will reduce types by 700

19:22 <headius[m]> and if we AOT with GraalVM this compiles fine anyway 😀

19:22 <headius[m]> oh I see what you are saying

19:22 <headius[m]> yeah that would be hot

19:23 <enebo[m]> I am including your reduction in that but for 3.0 it will reduce probably by 1000

19:23 <headius[m]> that would be a good test because those loads are pure overhead at startup

19:23 <enebo[m]> I am sure we will add at least 300 types for just the pattern matching feature

19:23 <enebo[m]> It replicates quite a bit of the grammar

19:24 <headius[m]> oh hey there is another possible thing we might be able to improve: generating those giant byte arrays

19:24 <enebo[m]> oh per source file?

19:24 <headius[m]> if you can twiddle it to generate them as a string we can just getBytes rather than emitting all those instructions to load it

19:24 <headius[m]> I mean the parser productions

19:25 <headius[m]> if this were bytecode it would be trivial... just stuff the bytes into a char and put that in constant pool

19:25 <enebo[m]> I still don't know what you mean

19:26 <headius[m]> the stuff you have to split in your post-processing

19:26 <headius[m]> the static inits that are too big

19:26 <headius[m]> if the numeric sequence were in a string in constant pool there would be no need for that

19:26 <enebo[m]> oh so you mean instead of a short[] which reassembles just make a string and the unpack

19:26 <headius[m]> right

19:27 <headius[m]> sorry I just started thinking about other things that could be simplified in there

19:28 <enebo[m]> ok so I need to make getbytes[] push two bytes into a short value

19:28 <enebo[m]> which is trivial if I can assume layout of a short and do something unsafe

19:29 <enebo[m]> It is amusing as well I break it into 4

19:29 <headius[m]> enebo: I will merge default gem PR now to 9.2 and master

19:29 <enebo[m]> those four will fit into a single long[]

19:30 <headius[m]> if I can get a head build snapshot to deploy then marcandre should be able to confirm it by restarting a rubocop-ast build

19:30 <enebo[m]> yeah

19:31 <headius[m]> enebo: oh dunno if you saw but I was able to wipe out bin/rake, ri, and rdoc too

19:31 <headius[m]> so switching branches should not mess up rake bin anymore... it is ignored and not versioned on either branch now

19:31 <enebo[m]> nice

19:49 travis-ci has joined #jruby

19:49 <travis-ci> jruby/jruby (jruby-9.2:d9f3585 by Charles Oliver Nutter): The build was broken. https://travis-ci.com/jruby/jruby/builds/219796581 [178 min 4 sec]

19:49 travis-ci has left #jruby [#jruby]

20:04 <headius[m]> of course it was

20:13 <lopex> the broke was built

20:13 <headius[m]> ah that same bad concurrent-ruby spec I have reported and excluded on master

20:14 <headius[m]> oh well

20:29 <enebo[m]> well trivial change and parser is generated with lambdas

20:29 <enebo[m]> I don't perceive any significant difference

20:29 <headius[m]> that's fire, fam

20:29 <headius[m]> how many fewer classes?

20:30 <enebo[m]> I may dust off BenchParser to see if any long running change occurs

20:30 <enebo[m]> 599 with 9.3 but it will be quite a few more for 9.4

20:30 <enebo[m]> So like 10% reduction

20:30 <enebo[m]> I think we make like 6k types in a simple Rails app

20:31 <enebo[m]> It may be less but why quibble

20:31 ur5us has joined #jruby

20:31 <enebo[m]> 8) jdk-15+36 < CURRENT

20:32 <enebo[m]> For newer JVM I used this version which is not latest of even 15 but I have noticed this is has a startup hit over 8 even using parallel gc

20:32 <enebo[m]> It is not huge but I sort of like things to go the other way

20:33 <headius[m]> wow

20:33 <enebo[m]> Parser is a lot of types

20:33 <headius[m]> startup hit on 9+ is due to verification mostly

20:33 <enebo[m]> ripper is another 600 now that I think about it :)

20:33 <enebo[m]> oh right

20:34 <enebo[m]> HAHA

20:34 <headius[m]> we can't use boot classpath so JRuby goes in module path

20:34 <enebo[m]> darkmatter for 9+ since I will eliminate verification on 1200 classes

20:34 <headius[m]> gods maybe our jar will get back down to a reasonable size

20:34 <enebo[m]> hah I did not even check that

20:34 <headius[m]> and the ripper classes will not even be needed unless someone uses ripper

20:34 <headius[m]> and the productions not hit won't materialize

20:35 <enebo[m]> Have we dusted off the old idea if just JITTing an entire type to one class

20:38 <enebo[m]> wow when did jruby.jar get over 15MB

20:39 <headius[m]> 2010-2021

20:39 <enebo[m]> HAHA

20:44 <lopex> headius[m]: enebo[m] https://www.youtube.com/watch?v=krB0enBeSiE&t=4788s

20:46 <enebo[m]> BEFORE: 15340 -rw-rw-r--. 1 enebo enebo 15707591 Mar 11 14:39 lib/jruby.jar

20:46 <enebo[m]> AFTER: 14612 -rw-rw-r--. 1 enebo enebo 14962387 Mar 11 14:44 lib/jruby.jar

20:46 <headius[m]> so over 700k

20:46 <headius[m]> that's solid

20:46 <lopex> NUMBERS

20:46 slonopotamus[m] has joined #jruby

20:46 <headius[m]> slonopotamus: hello there

20:47 <rdubya[m]> 👏

20:48 <slonopotamus[m]> @headius yay! I am a bit surprised that you decided to handle zlib issue I've reported so quickly so I thought it would be more convenient if you could poke me in more realtime fashion here)

20:48 <headius[m]> yeah welcome

20:48 <slonopotamus[m]> looks like my @ powers are too weak :D

20:49 <headius[m]> I did not mean to imply the gz is definitely bad, we just have never seen this reported so I was curious

20:49 <headius[m]> name<tab> should complete in Element client

20:49 <slonopotamus[m]> I remember that I one was already here. There was a story about spawning subprocesses on JRuby + Windows...

20:49 <headius[m]> the mobile client uses @... they need to decide on one way 🙄

20:49 <slonopotamus[m]> * I remember that I ocne was already here. There was a story about spawning subprocesses on JRuby + Windows...

20:49 <slonopotamus[m]> * I remember that I once was already here. There was a story about spawning subprocesses on JRuby + Windows...

20:50 <lopex> and how much could we win on transcoding tables ?

20:50 <headius[m]> lopex: do they have a lot of inner classes?

20:50 <lopex> headius[m]: no, the tables themselves

20:50 <headius[m]> oh like the constant pool trick I was talking about

20:50 <headius[m]> yeah could be big and help startup too

20:50 <lopex> the bigger offenders are probalby not user

20:51 <headius[m]> just yank the table out of a string in constant pool

20:51 <lopex> is there such a thing like order in zip ?

20:51 <lopex> for files

20:51 <slonopotamus[m]> yes

20:51 <headius[m]> slonopotamus: hah windows and processes, great stuff

20:51 <slonopotamus[m]> * lopex: yes

20:51 <headius[m]> and by great I mean super frustrating to support

20:51 <lopex> headius[m]: you mean a binary ?

20:51 <lopex> slonopotamus[m]: can we affect that ?

20:52 <slonopotamus[m]> lopex: when you create a zip, you add entries to it one by one. and they end up exactly in that order. you can even decide *per file* whether it will be compressed or not.

20:52 <lopex> ooh

20:52 <slonopotamus[m]> * lopex: when you create a zip, you add entries to it one by one. and they end up exactly in that order in the file. you can even decide _per file_ whether it will be compressed or not.

20:52 <lopex> so we might tell maven to do that in some order

20:52 <headius[m]> FWIW this is the code that adds those bytes: https://github.com/jruby/jruby/blob/cf4d39c354995a6c67fb86334b2cefe350d5938c/core/src/main/java/org/jruby/ext/zlib/JZlibInflate.java#L318-L327

20:53 <lopex> slonopotamus[m]: cool

20:53 <headius[m]> there is data left on the input buffer and according to this we are supposed to tack them on

20:53 <headius[m]> but clearly that is not the whole story

20:54 <slonopotamus[m]> comment on that code says that it mimics what MRI does. "but clearly that is not the whole story" :D

20:54 <lopex> tables/Transcoder_Big5_WordArray.bin is 400kb

20:54 <lopex> compressed to 230

20:54 <lopex> to not an empty air

20:55 <lopex> and we ship the whole thing every time

20:55 <headius[m]> try zopfli

20:55 <headius[m]> it didn't have a big impact on JRuby but it might do better on that data

20:56 <lopex> via maven ?

20:56 <headius[m]> there is a maven plugin

20:56 <headius[m]> but you could just try zopfli on the file directly, or recompress the jar

20:56 <lopex> or force insertion order like slonopotamus[m] said

20:57 <headius[m]> yeah there are options

20:57 <headius[m]> bzip it and add a dependency to jcodings 🤪

20:57 <lopex> yeah, already considered

20:57 <slonopotamus[m]> lopex: what profits do you expect to have by reordering files? AFAIK, each file inside zip is compressed independently.

20:57 <lopex> no idea

20:58 <slonopotamus[m]> so, you can get better read patterns by placing files that are used together near each other. but you won't win on total size that way.

20:58 <slonopotamus[m]> * so, you can get better read patterns by placing files that are used together near each other within zip archive. but you won't win on total size that way.

20:58 <lopex[m]> I thought reading from jar might have an impact regarding file order

20:58 <slonopotamus[m]> * so, you can get better read patterns by placing files that are used together near each other within zip archive. but you won't win on total archive size that way.

20:59 <headius[m]> https://github.com/ruby/ruby/blob/b44f7151c71011460877bdba549453aaeada88fe/ext/zlib/zlib.c#L1123-L1125

20:59 <headius[m]> so that is where CRuby appends the extra bytes

20:59 <slonopotamus[m]> Is "zstream_append_input" a trivial "append bytes" or it does some nasty things?

21:00 <slonopotamus[m]> also, these bytes could be swallowed somewhere else.

21:00 <lopex[m]> for example Transcoder_Big5_WordArray.bin is probably never user

21:00 <lopex[m]> *used

21:00 <headius[m]> rb_str_buf_cat(z->input, (const char*)src, len);

21:00 <headius[m]> pretty much just appending

21:01 <headius[m]> lopex: given the ordering and dict issues perhaps we should compress all of them together as one gz blob with a header

21:01 <lopex[m]> hmm big5 and gbk take like 700Kb uncompressed

21:01 <headius[m]> slonopotamus: yeah I may have to step through the C code to see why they don't have the extra bytes at this point

21:02 <lopex[m]> a header ?

21:02 <headius[m]> to indicate how big each transcode chunk is

21:02 <headius[m]> in the uncompressed aggregate

21:02 <lopex[m]> ah

21:02 <headius[m]> are we at least lazily loading them?

21:02 <lopex[m]> yeah

21:03 <lopex[m]> everythin in jcodings is lazy

21:04 <lopex[m]> including impl classes

21:04 <slonopotamus[m]> headius: have you tried minifying test data? like, a zero-sized data that was zlib'ed and got a one or two \0 bytes appended? I actually know almost nothing about zlib format and why it is OK to passthrough trailing bytes instead of breaking with "OMG, strange bytes after end-of-stream!" :D

21:05 <headius[m]> yeah I don't understand this logic either but reading around it they may be resetting input to zero elsewhere

21:05 <headius[m]> input data shouldn't matter, I would expect us to do this for any gz data with trailing junk

21:06 <headius[m]> given this logic

21:07 <slonopotamus[m]> (the worst thing that can currently happen is if it turns out that JRuby implementation is correct and all other are wrong :D I one stumbled upon a bug with multiple addr2line implementations [a program that translates program address into a function/file/line info within executable file] where 4 out of 6 implementations were failing to pass my testcase properly)

21:08 <slonopotamus[m]> * (the worst thing that could currently happen is if it turns out that JRuby implementation is correct and all other are wrong :D I one stumbled upon a bug with multiple addr2line implementations [a program that translates program address into a function/file/line info within executable file] where 4 out of 6 implementations were failing to pass my testcase properly)

21:08 <headius[m]> a workaround for you would be to just call the JDK classes

21:08 <headius[m]> it will work fine

21:21 <headius[m]> ok whoever ported this logic misinterpreted it

21:21 <headius[m]> it appends the remaining bytes to the input string so at the end you have the input string containing only those extra bytes

21:21 <headius[m]> it does not append to the output strean

21:21 <headius[m]> stream

21:24 <slonopotamus[m]> ouch. you're right, though I also misread it

21:24 <slonopotamus[m]> they append to next_in

21:24 <headius[m]> I am not sure we are handling the input buffer correctly here but I can flip this to append there and see what we have

21:25 <headius[m]> well they append to z->input from next_in

21:25 <headius[m]> if it is a coming from a stream they allocate a string for z->input to hold it

21:25 <headius[m]> bit definitely not going to output

21:27 <slonopotamus[m]> no, wait. zstream_append_input appends data from second arg into first arg.

21:27 <slonopotamus[m]> * no, wait. `zstream_append_input` appends data from second arg into first arg.

21:28 <slonopotamus[m]> so... they append `z->stream.next_in` into `z`.

21:28 <headius[m]> well into z->input

21:28 <slonopotamus[m]> okay, you;re right

21:28 <headius[m]> constructing that if necessary

21:28 <slonopotamus[m]> * okay, you're right

21:28 <headius[m]> so someone assumed this was going to out stream

21:29 <headius[m]> I will try a quick patch and then see when this was added

21:29 <headius[m]> $ jruby blah.rb

21:29 <headius[m]> 60992

21:29 <headius[m]> 👍

21:29 <slonopotamus[m]> yay!

21:30 <slonopotamus[m]> I wonder if there are any multi-stream tests. I mean, when you use zlib to uncompress things that go one after one.

21:30 <slonopotamus[m]> It looks like that such scenario is supposed to be supported (otherwise, why `next_in` at all)

21:31 <slonopotamus[m]> * I wonder if there are any multi-stream tests. I mean, when you use zlib to uncompress things that go one after one in the same byte flow.

21:31 <headius[m]> we pass and fail the exact same number of tests from MRI test_zlib

21:31 <headius[m]> oh well

21:31 <headius[m]> at least this seems like the right direction... I will push a PR and do a little more poking around

21:32 <headius[m]> they do have some chunked stream tests I think

21:32 <slonopotamus[m]> oh, "chunked". didn't have a proper word for this)

21:32 <slonopotamus[m]> * oh, "chunked". didn't have a proper word for this in my vocabulary)

21:32 * headius[m] < https://matrix.org/_matrix/media/r0/download/matrix.org/LzigdHjPvyCeckEqnCOrJPtk/message.txt >

21:32 <headius[m]> work continues

21:33 <headius[m]> this logic probably needs a re-port... it was contributed a long time ago and chunked support was probably added to CRuby after that

21:38 <slonopotamus[m]> btw, channel topic says 9.2.15.0 while there's 9.2.16.0 out already

21:40 <headius[m]> https://github.com/jruby/jruby/pull/6612

21:40 <headius[m]> oops enebo topic

21:40 <headius[m]> I will fix

21:40 <enebo[m]> hmm weird I thought I did that

21:41 enebo has joined #jruby

21:41 <headius[m]> slonopotamus: to keep risk low I will probably not attempt any larger scale re-port for 9.2.17.0, but I will file an issue to align behavior in 9.3 (which may or may not happen then without help)

21:41 <enebo[m]> ok it is also wrong in irc.fixing

21:41 <headius[m]> this should at least solve your issue

21:42 ChanServ changed the topic of #jruby to: Get 9.2.16.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

21:42 <enebo> exit

21:42 <enebo> lol

21:42 enebo has left #jruby [#jruby]

21:45 <slonopotamus[m]> headius: looking at your code... is buffer you're appending to guaranteed to have enough capacity or it might need a reallocation?

21:45 <headius[m]> hmm

21:45 <headius[m]> input.append will expand

21:46 <slonopotamus[m]> it obviously can't reallocate the way you wrote the code

21:46 <headius[m]> input is a ByteList, our growable collection wrapping byte[]

21:47 <headius[m]> I realize I am not checking it for nil though so I should allocate a new buffer in that case

21:47 <headius[m]> ahh actually this always allocates input

21:48 <slonopotamus[m]> oh, ok, I've misread this code **again** and thought it is appending to `next_in`. need some sleep)

21:48 <headius[m]> the logic in CRuby is goofy... they allocate this buffer and copy the incoming bytes to it and then proceed from there

21:48 <headius[m]> we mimic that but clearly didn't handle the extra part properly

21:48 <headius[m]> slonopotamus: if you get a chance to test out that patch let me know

21:49 <headius[m]> you can build from the branch on headius/jruby or wait until it merges

21:51 <slonopotamus[m]> I will hopefully be able to test it tomorrow. It's almost 1am here, not the best time for figuring out how to build jruby from source))

21:52 <headius[m]> yeah no worries... for future, just check out, "./mvnw", and run jruby from bin/

21:55 <slonopotamus[m]> 👍️

22:04 <headius[m]> enebo: we may need to be a bit more exclusive in the packaging of the stdlib artifact

22:04 <headius[m]> [INFO] Copying 43655 resources to /Users/headius/projects/jruby-9.2/lib/target/classes/META-INF/jruby.home/lib/ruby/gems/shared

22:04 <enebo[m]> wot

22:04 <headius[m]> because it needs to copy all the default and bundle gems, I made it copy everything... which is a bit of an issue when you have lots of other gems installed in a local repo

22:06 <enebo[m]> oh you mean compiling it locally from your dev env though

22:06 <headius[m]> yes

22:06 <enebo[m]> ok

22:06 <headius[m]> I wouldn't care but it is slow

22:06 <enebo[m]> yeah the workaround of having to always build from a shallow clone is not very appealing

22:07 <enebo[m]> Perhaps any exclusion somehow is mapped to -SNAPSHOT

22:07 <headius[m]> this doesn't really affect anything visible in a local dev env except copying a lot of crap

22:07 <enebo[m]> If possible

22:10 <headius[m]> ugh I forgot to push docker image

22:11 <headius[m]> we need a release manager

22:15 <headius[m]> https://github.com/docker-library/official-images/pull/9781

22:18 <headius[m]> aargh github UI and URLs changed mid triage

22:18 <headius[m]> apparently github.com/issues is a 404 now?

22:20 travis-ci has joined #jruby

22:20 <travis-ci> jruby/jruby (master:f69c08c by Charles Oliver Nutter): The build was fixed. https://travis-ci.com/jruby/jruby/builds/219809430 [206 min 26 sec]

22:20 travis-ci has left #jruby [#jruby]

22:20 <headius[m]> enebo: that means the default gem fixes are in snapshots now

23:18 ur5us_ has joined #jruby

23:20 ur5us has quit [Remote host closed the connection]

23:34 ur5us_ has quit [Ping timeout: 260 seconds]

23:39 ur5us_ has joined #jruby