Yes, I see that I missed two important aspects of programming the LPC1114: sector erase limit and block programming limit.
But, if I utilize the method I described above, where I repeatedly index my small data chunk into a block, taking the chunk from FF to data, then incrementing to the next chunk location in the block, each time programming but not erasing, then I do increase the endurance by the ratio of the chunk times the block. With a 8-byte chunk there will be 32 locations in the block, times 16 blocks in the sector -- we index inside the block changing the chunk from FF to my data, and the previous indexed chunk to 00, then program the whole block.
FFs onto FFs don't stress, and 00s onto 00s don't stress -- therefore, as you move from block to block with the chunk changing each time only from erased to programmed, each cell is only stressed once -- then erase the entire sector at the end, and start over.
That does change the endurance. By a whole lot. In the example I gave it goes from 10,000 to 5,120,000 times for each sector.